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METHOD AND APPARATUS FOR ANALYSIS 
OF CHROMATOGRAPHIC MIGRATION PATTERNS 

I. Background of the Invention 
A. Field of the Invention. 

This invention relates to the field of signal detection and analysis of chromatographic 
migration patterns as commonly applied to mixtures of molecules. More specifically, this 
invention relates to a method and apparatus for signal detection and analysis of 
chromatographic migration patterns as applied to the determination of DNA sequences. 



B. Description of Related Art. 

The ability to efficiently and accurately detect and analyze information-containing 
signals in chromatographic data is important for handling large amounts of data. Such an 
ability is particularly important for projects such as the Human Genome Project, where large 

1 5 amounts of information will be generated which must be analyzed and integrated to produce a 
representative sequence of an entire human genome. To expedite the analysis of DNA 
sequence information, numerous methods have been developed. For example, a U.S. patent 
to Clark Tibbetts (No. 5,365,455) discloses a method for the automated processing of DNA 
sequence data. This patent is incorporated by reference herein in its entirety. The Tibbetts' 

20 method derives information from informative variables obtained from the input data set. 

Such informative variables may include the relative intensities between adjacent signals, the 
relative signal spacing and pattern recognition factors. 

The Tibbetts* method is limited, however, by the quality of the chromatographic data. 
Tibbetts' method relies to a certain extent on the reproducibility of chromatographic data to 

25 train the base identification ("calling") system. The apparatus generating the 

chromatographic data, therefore, needs to be consistent from run to run to avoid retraining the 
algorithm. Because chromatographic data frequently contain background noise and migration 
aberrations which obscure information-containing signals, analyses based on signal spacing 
may produce errors in signal identification. Similarly, because signal intensity often varies in 

30 an unpredictable manner, signal identification based on intensity may also result in significant 
identification errors. 
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A U.S. patent of Thomas Stockham and Jeff Ives (No. 5,273,632) discloses an 
alternate method for base identification using blind deconvolution ("BO"). This patent is 
incorporated by reference herein in its entirety. The method of Stockham and Ives uses blind 
deconvolution to deblur information-containing signals in chromatographic data. This 
5 method, however, is significantly limited in the following manner. First, it relies on data 
derived from scanned autoradiogram image data. Second, the method requires user input of 
the BD filter bandwidth and programmer alterations to various thresholds. Third, the 
Stockham and Ives method does not adequately deal with lane to lane mobility differences. 
Fourth, the insertion/deletion and correction logic was too simple. Fifth, the putative peak 
1 0 detection was based on thresholds, and therefore, could miss band detections when band 

amplitudes dropped below the threshold. Sixth, the method of Stockham and Ives lacked the 
ability to align and merge adjacent sample segments. Finally, that method lacked band 
quality measures useful in automatic data routing and or sequence assembly. 

15 II. Summary and Objects of the Invention 

The present invention includes a method and apparatus for the detection and analysis 
of information-containing signals in chromatographic data. The invention also includes a 
method and apparatus for detecting and sharpening signal peaks in chromatographic data. It 
20 is an advantage of the present invention that a chromatographic data from a wide variety of 
separation processes can be analyzed. Such separation processes include, but are not limited 
to, gel and capillary electrophoresis. 

The present invention includes the steps of preprocessing signal data, reading 
successive sample segments, selecting blocks of high quality sequence and then producing 
25 traces of aligned high quality sequences. It is an advantage of the present invention that the 
chromatographic data may include single fluor samples fractionated in multiple lanes and 
multiple fluor samples fractionated in single lanes. 

It is an object of the present invention to provide a method for preprocessing 
chromatographic data by baseline subtracting background noise. It is an advantage of the 
30 present invention that the method of baseline subtraction may be varied according to the type 



SUBSTITUTE SHEET (RULE 26) 



WO 98/11258 



PCT/US97/16933 



3 

of chromatographic data being analyzed. It is a further advantage of the invention that sparse 
chromatographic data may be interpolated during preprocessing. 

It is an object of the present invention to read the preprocessed signals in successive 
sample segments. It is an advantage that the sample segment size may be sufficiently large to 
5 provide for rapid and efficient signal analysis. 

It is an object of the invention to provide a method and apparatus for detecting 
information-containing signals which are not uniformly distributed in the chromatographic 
data. This analytic technique uses iterative blind deconvolution to determine band frequency 
in sample segments. It is an advantage of the invention that the filter-band width is 
1 0 automatically varied during iteration to optimally detect the signals in the preprocessed 
chromatographic data. It is a further function of the invention to detect and correct signal 
data derived from chromatographic data which have segments which are short in one or more 
signal types (for example, "band-lite" signals). 

It is an object of the present invention to provide a method and apparatus to detect and 
15 correct for mobility differences. It is a feature of the invention that mobility differences are 
corrected using a Monte Carlo alignment rather than using band position or spacing 
information. It is an advantage of the present invention that the Monte Carlo alignment is an 
iterative process to optimize signal alignment. 

It is an object of the invention to enhance band detection using fuzzy logic. It is a 
20 feature of the invention that band detection is performed using fuzzy logic blocks, each block 
providing a particular method of data analysis. It is an object of the invention that each fuzzy 
logic block may be optimized for a particular analytic function. 

It is an object of the present invention that the invention may optionally provide a 
quality measure for each signal. It is a feature of the invention that the quality measure can 
25 be utilized during subsequent alignment steps. It is an advantage of the invention that the 
quality measure can provide left and right cutoff point to limit subsequent analysis to data 
above a given quality measure. 

These and other objects, features and advantages of the invention will be clear to a 
person of ordinary skill in the art upon reading this specification in light of the appending 
30 drawings. 
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III. Brief Description of the Drawings 

Figure 1 depicts a flow chart for the invented base calling method. 
Figure 2 depicts a flow chart of the preprocessing step of Figure 1 . 
Figure 3 depicts a flow chart of the base reading step of Figure 1 . 
Figure 4 depicts a flow chart of the extra-normalization step of Figure 3. 
Figure 5 depicts a flow chart of the peak detection and refinement step of Figure 3. 
Figure 6 depicts a flow chart of the OmitOkN fuzzy logic block of Figure 5. 
Figure 7 depicts a flow chart of the OKSpMembership fuzzy logic block of Figure 6. 
Figure 8 depicts a flow chart of the OmitOkN Bad Spacing Membership fuzzy logic 
block of Figure 6. 

Figure 9 depicts a flow chart of the OmitOkN Cross Banding fuzzy logic block of 
Figure 6. 



Figure 


1 0 depicts a 


flow 


Figure 


1 1 depicts a 


flow 


1 5 Figure 


12 depicts a 


flow 


Figure 11. 






Figure 


1 3 depicts a 


flow 


of Figure 11. 






Figure 


1 4 depicts a 


flow 


20 Figure 


1 5 depicts a 


flow 


Figure 


1 6 depicts a 


flow 


of Figure 15. 






Figure 


1 7 depicts a 


flow 



logic block of Figure 15. 

25 Figure 1 8 depicts a flow chart of the BaseQual Width Membership fuzzy logic block 

of Figure 15. 

Figure 1 9 depicts a flow chart of the BaseQual Shape Membership fuzzy logic block 
of Figure 15. 

Figure 20 depicts a flow chart of the BaseQual Baseline Buzz Membership fuzzy 
30 logic block of Figure 15. 
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Figure 21 depicts a flow chart of the BaseQual OK Spacing Membership fuzzy logic 
block of Figure 15. 

Figure 22 depicts a flow chart of the Baseline Substraction algorithm of Figure 2. 
Figure 23 depicts a flow chart of the Pre-processing Begin/End Detection of Figure 1 . 

5 

IV. Detailed Description of the Preferred Embodiment 

The present invention provides a method and apparatus for detecting and analyzing 
information-containing signals in chromatographic data. In the preferred embodiment, the 

1 0 invention analyzes chromatographic data from DN A sequence analysis machines employing 
various and sundry imaging techniques, including autoradiograms, four lane-single fluor, and 
single lane-four fluor data. The invention further includes general and dedicated apparatuses 
for performing the invented method. Finally, the invention also includes a kit comprising one 
or more of the following components in combination with the invented method: a DNA 

15 sequence apparatus, signal detection apparatus, information storage devices for preserving 
chromatographic data before, during and after analysis, and output devices for displaying the 
analyzed sequence information. 

For DNA sequence analysis, the invented method takes as input the output from a 
DNA sequencing apparatus and returns the called sequence, aligned traces, and band metrics 

20 for each called base. After each sample segment is read, its called sequence, aligned traces, 
and band metrics are joined to previous read segments. After an entire ladder has been read, a 
final step analyzes each called base's metrics and assigns a quality value. The quality values 
are used to identify the largest block of high quality sequence and establish left and right 
cutoff values. If a "preamble" sequence is available, the base calling software will attempt to 

25 locate the preamble in the called sequence and set the left cutoff value beyond it. Such 
preamble sequences may include primer sequences or known sequences which are to be 
excluded from the collected data. This latter step improves the chance that the sequence 
called by this software would merge with the least amount of human intervention. 

The following sections provide a detailed description of each function of the invented 

30 method. The illustrative embodiments of the invention exemplify the application of the 
useful characteristics discussed below, and further reference to these and other useful and 
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novel features is made in the following discussion of each illustrative embodiment. These 
exemplary embodiments are intended to limit neither the scope of the method and apparatus 
that are needed for performing the invented method. 

Referring to Figure 1. the invented base calling software first performs a 
5 preprocessing step 102 on the input data set 101. Preprocessing can include spectral 

separation, background subtraction and interpolation of input data set 1 01 . The preprocessed 
data set 103 then enters Steps 104-106, which reads successive sample segments of the 
preprocessed data. The sample segments 104 may be any suitable size which provides 
efficient signal analysis. In the most preferred embodiment of the invention, the first segment 
1 0 is 2048 scaniine samples. Subsequent segments are also 2048 samples, with 148 samples 
overlapping the previous segment. The following description is based on the most preferred 
sample segment size, although the scope of the invention is not intended to be limited to that 
segment size. 

Each sample segment 104 is first analyzed to estimate the coarse band spacing. 

1 5 Subsequently, the segment 1 04 is analyzed at second time 1 06 to refine the predicted band 
spacing. The band spacing drives the selection of the reconstruction filter employed during 
blind deconvolution. Band spacing and filter band width are inversely related. Once a 
sample segment of 2048 scanlines is read twice (a refined sample segment) and its band 
spacing measured and normalized for that 2048 scaniine segment, the next sample segment of 

20 2048 samples is read. The next sample segment overlaps the previous segment by 148 
scanlines (or about 1 5 nucleotide bases) to establish the frame and relative positioning of 
adjacent segments. Subsequent segments 104 are similarly processed until the final sample 
segment 1 04 is reached. If fewer than 2048 scanlines are available in the original data set, 
then pseudo-random noise is generated to fill the sample segment to the required 2048 

25 samples. Pseudo-random noise is preferred because sources of non-random noise will cause 
improper processing during the blind deconvolution and alignment steps. 

Once all sample segments have been processed (read twice, normalized and the 
segments aligned), the processed and aligned data is analyzed in three fuzzy logic blocks. 
Fuzzy logic allows multivalued logic to enhance peak detection. By using fuzzy logic, a gap 

30 is "somewhat big," a band is "not so tall." Fuzzy logic also provides logic operators (for 
example AND, OR, NOT). Each fuzzy logic block in the base calling method provides a 
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particular analysis of its data. The logic blocks operate on normalized input data and 
essentially classify each band based on absolute and relative criteria which are based on the 
band's neighboring bands. For example, ftiz2y logic block 108 analyzes each base, its 
upstream context and assigns a quality value to each called band (base identity). Following 
5 the assignment of quality values, fuzzy logic block 108 also identifies the largest block of 
high quality data in the processed and aligned data 107. The right and left cutoff points for the 
high quality data block are recorded and set as left and right cutoff points. The output data 
set 109 includes the finished traces, the called bases with their assigned quality values and the 
suggested left and right cutoff points. Output data set 109 can optionally be visually 
10 enhanced to normalize all bands to about the same band amplitude and to remove the saw 
tooth appearance of the non- visually enhanced traces. 

A. Preprocessing 

Referring to Figure 2, input data set 201 is an 2048x4 trace matrix. The first step is to 
1 5 established Begin and End points by analyzing input data set 201 for the first scanlines 
containing above-background signal and for the last scanlines containing such signal (See 
trace 202a as an example.) Large signal spikes, due to artifacts such as primer peaks, are 
excluded. 

Referring to Figure 23, the Preprocessing Begin/End subroutine identifies the Begin 
20 and End points based on signal amplitude. The Begin and End points define the usable signal 
for subsequent operations. Usable signal typically begins just left of the largest left-most 
signal amplitude (the so called primer peak), and continues until either the end of the sample 
segment data or another region of large signal amplitude is encountered. (The latter peak is 
typically called a biostreptation peak.) More specifically, steps 2302 through 2305 identify 
25 the putative start and end points by breaking the sample segment into zones and determining 
the maximum signal amplitude in each zone. Step 2306 determines whether a second primer 
peak is present. If the second peak is present, Step 2306 sets the Begin point at the second 
primer peak. Steps 2307 and 2308 make final adjustments to the Begin and End points, 
setting the Begin point to the first sample with amplitude below the mean of the first half of 
30 the signal, and setting the End point back 350 samples from end. 
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Referring to Figure 2, the baseline of the Preprocessed Begin/End data 202 is then 
determined at Step 203. A single baseline is established for each fluor of the Preprocessed 
Begin/End data 202. The baseline is subtracted from Preprocessed Begin/End data 202 to 
generate a baseline subtracted data set 203. For example, after baseline subtraction, the 
5 localized data set 207 becomes baseline subtracted data set 208. In a more preferred 
embodiment of the invention, a single baseline is established based on data from all lanes. 
This best determines the baseline beneath a run of poorly resolved bases in one lane. 
Currently available DN A sequence data precludes this embodiment because no two fluors 
reliably a common baseline. 

10 Referring to Figure 22, the baseline can be established by estimating the baseline of 

the Preprocessed Begin/End 2201 . In the working embodiment, each trace lane is processed 
twice using a rising exponential threshold. One pass is made from left to right (establishing 
one baseline) (Step 2202), and the next pass is from right to left (establishing another 
baseline) (Step 2203). By taking the geometric mean of the two baseline approximations a 

1 5 fairly natural subtrahend is produced. (See sample traces 2205 and 2206.) 

To establish a baseline approximation using a rising exponential threshold, a threshold 
is initially set to the lowest point found within the first 10 samples. As each successive 
sample is considered, the threshold is incremented by an exponential which slowly ramps 
upward. When a subthreshold sample is encountered, the baseline between the previous 

20 subthreshold point and the current point is taken to be a line segment between the points. The 
threshold is reset to the new subthreshold sample value and the process continues. If, after 
100 samples no subthreshold point has been found, a 100 point segment of the baseline is 
computed (again, piecewise linear), and the rate of rise of the exponential is increased. The 
exponential is calculated to rise by 1/3 the amplitude of the most recent subthreshold point 

25 over a span of 75 samples. 

Following baseline subtraction, baseline subtracted data set 203 is preferably 
spectrally or leakage separated. This step markedly improves the quality of capillary 
electrophoresis data. For slab gel data with a signal to noise ratio of 2.0 or less, separation 
step 204 significantly improves data quality, such that unreadable data can become readable. 

30 The separation step 204 is preferably performed during preprocessing without user input. 
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For four fluor-single lane data, the baseline-subtracted data set 203 is spectrally 
separated. For single fluor-four lane data, data set 203 is leakage separated. For either 
separation, the separation algorithm 204 builds a characteristic matrix (CHM) which is used 
to perform the separation. For spectral separation, the characteristic matrix captures the 
5 spectral cross-talk ratios in four fluor data. For leakage separation, the characteristic matrix is 
generated from the ratio of leakage from the signal in the center of the lane in question to the 
signal in adjacent lanes. For capillary electrophoresis data, the ratios are measured at a peak 
center For slab gel data, all data points are used to generate the characteristic matrix. 

A separation matrix is calculated according to the formula, 
10 SST^invCCHM'CHM 7 )- 1 

where the columns of CHM hold the ratios for each respective lane. The ratios are 
normalized so that the largest element in each column has a value of one. The result of 
separation 204 is a separated data set 204. 

Processing steps 104-106 are optimally performed on preprocessed data 206 
1 5 containing at least 8 scanlines (samples) per band. To increase the number of scanlines per 
band, a baseline subtracted data set 203 or a separated data set 204 may optionally be 
enhanced to double or triple the number of samples using cubic spline interpolation 205. 

B. Reading 

20 Referring to Figure 3, the exemplified reading step analyzes sample segments 301 of 

2048 scanlines. Each sample segment 301 first undergoes blind deconvolution 302 to cancel 
the effects of an unknown laurentian blurring function and to normalize the amplitudes of the 
traces. Blind deconvolution is described in the U.S. Patent to T.G. Stockham and J.T. Ives 
(No. 5,273,632), which is incorporated by reference herein. 

25 The presently invented method includes the following improvements over the method 

of Stockman and Ives. The first 2048 samples are blind deconvolved with an initial narrow- 
band guess for the filter band width ("FBW") value. The narrow-band guess is made so that 
the initial reading does not overestimate the band density along the sample segment. Given 
the resulting conservative estimate of the band density, a subsequent, more apt FBW is 

30 chosen and the segment is reread using it. The FBW chosen for the second read of each 
segment also serves as the FBW used for the 1st read of following segment. This iterative 
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approach to determining the best FBW has proven invaluable in practice; the band densities 
may vary from about 6 samples/band to about 50 samples/band, and do so not only within a 
given ladder but also from sequencing run to sequencing run. In the preferred embodiment of 
the invention, the method is adaptable to a wide range of acceptable inputs. 
5 The invented method includes a means for selecting the filter band width (FBW) 

during blind deconvolution 302. In the working embodiment median band spacing is mapped 
to a filter band width value using the following equations: 
K = 2*sqrt(ln(0.23)/-0.5) 

FBW = 0.5 + (K/med_band_spacing) * (2048/(2*pi)) 

10 The blind deconvolution step 302 deblurs the signal and normalizes its amplitude. 

Following blind deconvolution, an extra-normalization function 303 adjusts band spacing due 
to mobility differences in the samples. Extra-normalization 303 also corrects for the tendency 
of blind deconvolution to create spurious bands, especially in regions of mono-, di- or tri- 
nucleotide repeats where one or more lanes are band-lite for extended regions. 

15 Referring to Figure 4, extra-normalization 303 corrects two types of artifacts created 

by blind deconvolution- Path 406-410 cancels artifacts created in band-lite lanes. Briefly, the 
blindly deconvolved data set 406 is scanned for band-lite lanes by comparing the relative lane 
signal strengths and the relative lane band frequencies. The proxy used for the lane signal 
strength analysis is the 9T h percentile signal amplitude found in each lane. The proxy used 

20 for the band frequency is the proportion of the signal over which the lane in question has the 
largest signal amplitude. If a lane has less than 15% of the total bands found in all four lanes 
in a sample segment, and if the band amplitudes are low relative to the other lanes, the 
amplitudes of the bands in that lane are attenuated. If the band amplitude is lowest, those 
amplitudes are attenuated by one-half. If the band amplitude is above the lowest, the 

25 amplitudes axe attenuated to three-quarters of the original band amplitude. In contrast, in 

ideal sequence data, where A, G, C, and T are equal in frequency, each trace should dominate 
25% of the time. 

Extra-normalization path 401 -403 corrects for mobility differences between lanes and 
performs the actual band-lite attenuation 404. Briefly, the blind deconvolved data set 401 is 
30 analyzed to identify any regions with inordinately large or coincident bands. These regions 
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are set to zero (base-line). If these regions were not set to zero the Monte Carlo alignment 
algorithm 403 would produce an aberrant alignment which focused on separating them. 

Referring to Figure 14, mobility shifts are most noticeable in the autoradiogram and 
single fluor-four lane data particularly near the edges of the gel. The prior method of 
Stockham and Ives described an algorithm which attempted to align the lanes by driving the 
band spacing to as nearly a uniform value as possible. This approach was limited because, 
without proper alignment, many true bands would go undetected because they were shadowed 
by other bands. The algorithm attempted to normalize spacing between detected bands, yet 
the algorithm knew of only a simple majority of the bands present in the data. 

The present invention uses an algorithm which does not use band position or spacing 
information. Instead, the present invention seeks to maximize the integral of the "envelope" 
of all four lanes of data when they share a common baseline. Alignment is accomplished 
using a Monte Carlo search of a 3D space, where the x-axis defines the relation between the 
A and G lanes, the y-axis defines the relation between the AG relation and the C lane, and the 
z-axis defines the relation between the AGC relation and the T lane. An initial set of possible 
alignments are chosen, each triple is applied to the traces to be aligned, and the integral of the 
resulting envelope is calculated. A subset of the triples, those yielding the largest integrals, 
are then refined. The triple which yields the lowest integral is removed from the set under 
consideration. It is replaced by a triple which results from a random alteration of the triple 
which yields the largest integral. When either a maximum number of iterations has occurred 
or the variation within the set of high integral triples has reached a suitably low value, the 
highest yielding triple is chosen as the alignment vector for the segment under consideration. 

More specifically, the search is conducted in a three dimensional space, where the x- 
axis specifies the offset between trace, and trace 2 , the y-axis specifies the offset between the 
trace u registry and trace 3 , and the z-axis specifies the offset between the trace, , 3 registry and 
trace 4 (See illustration 1401). The algorithm employed was originally described by 
W.L.Price in The Computer Journal Vol 20, No. 4, which is incorporated by reference 
herein. 

Initially, a set of putative alignment solutions 1401 is generated. The addresses of the 
lattice points of 6 concentric cubes centered about a point in the space are used as the initial 
alignment solutions. The first time the procedure is used the central point of the concentric 
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cube lattice is the origin (x^O^-O^K)). Subsequent calls can either continue to center the 
lattice on the origin, or they can bias the search by centering the lattice on the previous 
alignment solution (x,,.,^,,^.,). 

Each alignment guess is converted into a shift vector of four values, wherein one 
5 value is 0. Each trace in the matrix is shifted by the amount specified in the shift vector, the 
envelope of the shifted traces is obtained (the maximum value of the four trace values found 
at each position along the traces), and is summed. The sum represents the integral of the 
envelope produced by the alignment guess. A low integral value represents a poor alignment 
(see, e.g. illustration 1402, where the bands are aligned behind others, not arranged "shoulder 

1 0 to shoulder"), whereas a high integral value corresponds to a good alignment (see, e,g. 
illustration 1407, where all bands are fully exposed, arranged "shoulder to shoulder" ). 

Once all alignment guesses have been evaluated, the worst alignment solution is 
replaced by a small, random perturbation of the best alignment solution 1 405. The new 
alignment solution is evaluated, and the process repeats, replacing the new worst alignment 

1 5 with a perturbation of the new best alignment. Eventually, the set of points in the 3D space 
converge about the best alignment solution 1406. 

Referring to Figure 3, following extra-normalization 303, Step 304, peak detection 
and refinement, occurs. The aligned traces then undergo putative peak detection. Referring 
to Figure 5, putative peak detection 502 is performed on the blind deconvolved, extra- 

20 normalized data set 501 (unstopped, attenuated and with the relative mobilities corrected). A 
trace envelope is first determined. The Stockham and Ives Patent described detecting peaks 
in each trace separately with thresholds derived from the underlying data. In the invented 
method, the trace envelope is peak-detected and no thresholds are employed. A peak is 
liberally defined to be a sample which is taller than either of its two neighbors. Subsequent 

25 processing culls this liberally defined putative peak list. This form of peak detection is both 
faster (one trace instead of four) and less prone to error (no subthreshold peaks). In contrast, 
the Stockham and Ives Patent required individual trace peak detection because its alignment 
algorithm attempted to determine lane alignment using peak location information. 

To identify errors in putative band detection 502, including insertion errors, each 

30 putative peak's instantaneous spacing, cross banding, height, and spacing to adjacent bands is 
measured (Step 503). These observed band spacing measurements are fit with a quadratic 
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curve. This quadratic fit is used as the expectation of the band spacing along the entire read 
segment. This approach to defining the expected band spacing is sufficiently general to 
handle segments where, as in the Stockham and Ives Patent, the average spacing is an 
adequate expectation, as well as segments where the spacing changes radically. In the 
invented method, more information was found necessary to sufficiently identify insertions 
and regions of deletions, and as a result, the invented method can resolve a series of insertions 
and deletions. 

The first of three fuzzy logic blocks 504, OmitOkN Fuzzy Logic, is then used to 
identify bands which are most likely insertion artifacts of the band detection process. This 
block classifies the detections as OK, AMBIGUOUS or OMIT. The putative bands given the 
OMIT classification are removed from the putative peak set. Referring to Figure 6, each band 
has several of its attributes 601 examined by this first logic block. If a band is where it ought 
to be with respect to either of its neighbors, then variable okSp is set "TRUE" (Step 602). 

Referring to Figure 7, the intent of the membership function for the OmitOkN Ok 
Spacing fuzzy logic block is to "accept" a spacing measurement which is an integer multiple 
of the expected spacing. Consequently, the observed spacing is normalized to a value on the 
interval [0..1] using its relationship to expected spacing (Step 702). In the example given in 
block 702, the normalized spacing of 0.3 is found to be OK with a truth value of 0.7 (Step 
703 and Example 704). Given the vagaries of band migration, compressions, band shape 
(hence band peak position), and other factors, a peak spaced 17 from its neighbor when the 
expected spacing is 13 is neither ideal nor terrible. 

Referring to Figure 8, for the OmitOkN Bad Spacing fuzzy logic block, the intent of 
the membership function is to "deprecate" a spacing measurement which is not an integer 
multiple of the expected spacing. Consequently, the observed spacing is normalized to 
interval [0..1] using its relationship to expected spacing (Step 802). In the example given in 
Step 802-03 and Example 804, a normalized spacing of 0.3 is found to be BAD with a truth 
value of 0.5.; this spacing is not as good as it could be. 

If a band is not where it ought to be with respect to either of its neighbors then 
variable abSp is set "TRUE" (Step 603). If the amount of "cross banding" (i.e. the amount of 
competition by two bands for a particular region of the read segment) is high, then variable 
badXb is set "TRUE" (Step 604). Similarly, if there is negligible cross banding then variable 
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neglXb is set "TRUE". Referring to Figure 9, cross banding designates the amount of 
competition for the scanlines underlying a detected band. Bands of a dubious nature have 
wide ranging cross band ratios due to their apex's proximity to the baseline* However, 
compressions and stops, with significant amplitudes, can have their cross banding measured. 
5 The cross banding membership function is best used in identifying OK or AMBIGUOUS 
bands. In the diagram provided (Example 901), the first complex has two bands vying for the 
same location, with the second largest band having one-half the amplitude of the largest. The 
cross banding ratio (Step 902) is the amplitude of the largest band divided by the amplitude 
of the next largest band, or in this case Xb aE 2.0. In the second complex, where one band is 

1 0 clearly the band of choice, this ratio approaches infinity. In the example given in Step 903, 
with a cross banding ratio of 1.5, the badXb membership is 0.25, while the negligibleXb 
membership is 1.0; in other words, while a ratio of 1 .5 is found negligible, the band 
legitimacy will be questioned. 

The band height is also categorized as either tiny or ok (Step 605). Referring to 

1 5 Figure 1 0, for the height membership functions the membership sets are best customized for 
the general signal quality one observes from the machine providing the data. In the working 
embodiment a function of the median value of amplitudes measured where bands intersect 
determines the height membership function break points. In particular, the tinyHt function 
breaks at 0.4*medjntersect_pt and is zero by l.l*med_intersect_pt Similarly, the okHt 

20 function comes off zero at 0.5*medjntersectj)t and flattens off at 1 .0 at 

1.5*med_intersect_pt, The blind deconvolution process normalizes band amplitudes to 
interval [0.. 1 ], with most bands having a height in excess of 0. 1 , This example given is 
typical in that it begins deprecating a band based on its height when the height falls below 
0.07. In the example given in Step 1002, the measured band height is 0.1 and has 

25 membership in okHt of 1 ,0 and in tinyHt of 0.0. The band has, per this example of the sets, 
sufficient height. 

These six variables then serve as input to fuzzy combinational logic. A significant 
advantage of fuzzy logic is that it works with and can resolve contradictions among rules 
involving these variables. Bands classified as OK are those with negligible cross banding and 
30 either OK height or OK spacing (Step 606). Bands classified as ambiguous exhibit bad cross 
banding and either OK height or OK spacing and little height (Step 607). Ambiguous bands 
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are typically those where the band is correctly positioned with sufficient amplitude but 
significant cross banding (Step 607). Bands classified as clear insertions, and therefore to be 
omitted, are characterized by negligible height (Step 608). Cross banding is not considered 
when deciding whether a band should be omitted because usually insertions are made very 
5 close to the baseline where cross banding measurements are unreliable. 

The strength of the rule firings is then used to scale the output sets (Step 609). In the 
example given (illustration 610), the output set OK is scaled with amplitude 1 .0, output set N 
(ambiguous) is scaled with 0.25, and set OMIT is scaled with 0.0. Defuzzification, or 
obtaining a crisp (conclusion) value from the output rule sets, is achieved by calculating the 
1 0 centroid of the resultant "masses". In the example, the conclusion reached is that the band is 
OK (Step 611). 

Referring to Figure 5, following fuzzy logic block 504, each peak's instantaneous 
spacing, instantaneous band width, spacing to its left neighbor (left spacing), band width and 
called bases is rerneasured (Step 505). These observed band spacings are fit with a quadratic 

1 5 curve which then serves as the expected spacing along the read segment. Similarly, the 
observed band width measurements are also fit with a quadratic curve which serves as the 
expected band width along the read segment. 

The second fuzzy logic block 506, GapCheck Fuzzy Logic, then identifies bands, or 
gaps between bands, where one or more bands may need to be inserted to achieve the band 

20 spacing predicted by the quadratic fit. This block classifies the detections as NORMAL, 
SPLIT or SUFFERING FROM UPSTREAM TURBULENCE. The gaps are split and a 
suitable number of bands are inserted (Step 507). The bands given the SPLIT classification 
are split a suitable number of times, with the division points being the centroid of the interval 
to be split. The centroid is used to place the insertion on the shoulder of a poorly defined 

25 band, and not in the bottom of the trough between the SPLIT band and its left neighbor. 

Depending upon the size interval, and the expected band spacing, one or more insertions may 
be made. Each insertion has a defined Begin, Middle and End scanline value. 

Referring to Figure 1 1 for more detail, each band pair considered by fuzzy logic block 
GapCheck has several attributes which are measured (Step 1101). In particular, the expected 

30 spacing curve, expected width curve, band width, left gap (gap to the leftmost neighbor) and 
sequence is determined. The upstream sequence is assigned a measure of GC-richness (Step 
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1 102). These measurements, coupled with the GC richness of the sequence of the 5 bands to 
the left, are informative in identifying bands which need additional bands added to their left. 
The gap is normalized with respect to the expected spacing onto interval [-1 ..inf] (Step 1 103). 
Unlike the OmitOkN logic, where the logic determines if a band is located where it should be 
S (independent of its absolute spacing but focusing instead on how far off the spacing curve it 
is), in the GapCheck logic block, the concern is on the absolute distance of the band from its 
left neighbor. If the gap is an integer multiple of the spacing curve (say three spaces from its 
left neighbor) two bands are inserted to its left to establish the proper spacing. In addition to 
the gap between bands in the pair, this logic also considers the widths of the bands. Band 

10 width is normalized onto interval [-1 ..infj (Step 1 104). Usually, when band resolution 
decreases and a region in the observed trace contains fewer peaks than are required, one or 
both bands in the pair is wider than it should be. The gap between the bands can be marginal 
and the band width can be the determining factor. Finally, large gaps and band widths should 
be viewed less aggressively in the presence of upstream GC-richness. 

1 5 The normalized left-gaps of each band in a band pair are classified as big (Step 1 1 05), 

medium (Step 1 106) or small (Step 1 107). Figure 12 provides details of the GapCheck band 
gap membership function. Briefly, the membership function characterizes an observed gap 
measurement (ogp) if it differs from expectation (egp). The gap is measured between B n and 
(Step 1201). The observed gap is normalized to interval 

20 [- 1 ..inf] with the equation: ngp = ogp/egp - 1 .0. Referring to the example in Step 1 203, a 
normalized gap of 0.1 is found to have 0.0 membership in all sets; that is, the gap meets 
expectations and is neither small, medium nor big (Step 1203). 

Referring to Figure 1 1, in Step 1 108 the normalized widths of each band in a band 
pair are classified as big. Figure 13 provides details of the GapCheck band width 

25 membership function. This membership function characterizes an observed band width 
measurement (owd) if it exceeds expectations (ewd). The width is measured between B n 's 
Begin and End points (Step 1301). In the example given in Step 1303, a normalized gap of 
0.2 is found to have membership in Big Width of 0.2; the band is not that wide, but it is wider 
than expected (Step 1304). 

30 Referring to Figure 1 1 , in RULE NORM (Step 1 1 09), band n is not marked as needing 

its left gap split if any of the following are TRUE: 
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a) there is a large gap (bigGapJ but the upstream sequence is GC-rich, or 

b) the gap to the first band in the pair is small (smlGap o0 ) and the two bands are not 

wide (ibigWid,, and IbigWid^) (i.e., ignore the gap between the bands), or 

c) the gap between the two bands is not large (!bigGap n ). 

5 In step 1110, RULE SPLIT marks band„ as needing its left gap split if either of the 

following are TRUE: 

a) the gap between the two bands is large (bigGapJ and the first and/or second band is 

wide (bigWid,,., or bigWidJ or, 

b) the gap between the two bands is large (bigGapJ and the gap left of the first band is 
10 not small (!smlGap n .,) and the upstream sequence is not gc-rich (Igcrich). 

RULE SPLIT (a) detects the combination of a wide and normal band (in either order) 
while RULE SPLIT (b) selects a run of wide bands separated by large gaps. The strength of 
the rule firings is then used to scale the output sets (Step 1111). In example 1 1 12, the output 
set NORMAL is scaled with amplitude 1 .0 and output set SPLIT is scaled with 0.25. The 

15 conclusion is formed by calculating the centroid of the resultant "masses". In example 1112, 
the conclusion reached is that the band is NORMAL. 

To identify putative peak insertion errors, Step 507 remeasures the cross banding, 
instantaneous spacing, band height, band amplitude, and the spacing (left and right gaps) to 
adjacent bands. The observed band spacing measurements are fit with a quadratic curve. 

20 This quadratic fit is used as the expectation of band spacing along the entire read segment. 
The OmitOkN Fuzzy logic block (Step 508) is then used to identify bands which are most 
likely insertion artifacts of the band detection process. Any and all such bands are removed 
from the putative peak set. Newly proposed insertions may be deleted in this step. The fuzzy 
logic band refinement stage adds the important advantage of reducing insertions and deletions 

25 and preventing arbitrary band calling when the reader encounters two or three base regions of 
signal dropout. See Figure 6 and the accompanying text for details of insertion detection. 
The set of putative peaks which survive this processing are recorded as the bands for the read 
segment under consideration (Step 509). 

30 C. Processing and Alignment 
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Referring to Figure 1, reading function 104-106 consecutively processes sample 
segments until all of the input data set 101 is analyzed. Because each sample segment 104 
overlaps the previous sample segment by a predetermined amount, the relative positioning of 
each read and aligned sample segment 106 is known. Step 107 assembles all of the read and 
5 aligned sample segments 106 to form a processed and reassembled sample segment 107. 

D. Post-Processing Editing 

In the working embodiment of the invention, a final process analyzes the set of 
measured band features with a third fuzzy logic block, BaseQual fuzzy logic block 1 09. This 

1 0 block assigns a quality measure to each called band. This block evaluates each band based on 
the band height, width, shape, left and right gap, cross-banding and baseline "buzz." This 
quality value, on the interval (0.0 to 1.0) can be used during subsequent sequence 
alignment/merging steps. The present invention uses the quality value to select the longest 
block of high quality sequence to be considered for alignment and merging with other 

1 5 sequences into a large DNA sequence. The algorithm that selects the left and right cutoff 
points generates a surface, with the x-axis labeled MOVING AVERAGE FILTER WIDTH, 
the y-axis labeled THRESHOLD, and the z-axis labeled READ LENGTH. The quality 
values are filtered with six moving average filters, and the filtered data is compared against 
nine thresholds. The longest contiguous block of above threshold filtered quality values 

20 provides the read length value for the surface for a particular filter width, threshold pair. 

Finally, this surface is scaled so that narrow filter and high threshold read lengths are favored 
over wide filter and low threshold read lengths. The surface maximum z-value is then chosen 
as the read length, and the associated first and last above threshold filtered quality value 
indexes serve as the left and right cutoff points, respectively. If a "preamble" sequence was 

25 submitted to this EDIT stage, and if the sequence is found beyond the established left cutoff 
point, the cutoff point is moved further left to exclude the "preamble" sequence. 

Referring to Figure 15, the BaseQual fuzzy logic algorithm assesses the quality of the 
called bases. Experience has shown that some sequence assembly algorithms fail to assemble 
sequences containing regions of incorrect sequence and that others can only succeed when 

30 each base is accompanied by an indication of its quality (or inversely, its probability of error). 
In the former case, if the incorrect sequence regions are masked from consideration by the 
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assembly program, the bulk of the good sequence will successfully assemble. In the latter 
case, if the low quality regions are identified, the overall base caller product will assemble. In 
either case, incorrect sequence, encountered in isolation, can be and usually is identified by an 
experienced technician using visual inspection. That process is time consuming and 
5 monotonous and subtle errors may go undetected. In general though, incorrect base calling is 
done where the underlying data traces are marginal. 

The BaseQual routine, automates quality assessment by measuring and analyzing 
multiple features (Step 1501) of each called base. Fuzzy logic is used to identify certain band 
presentation patterns and assign levels of quality to them. These band features include the 
1 0 band height, cross-banding, band width, band shape, the band's small gap and the band's 
large gap. 

Band height variations are informative in many of the classifications* Six fuzzy 
variables are used to classify a band's height (tiny, small, moderate, normal, tall and 
collectively, OK) (Step 1 502). Referring to Figure 16 for details of the BaseQual height 

1 5 membership functions, the membership function characterizes an observed band height 

measurement. A band with a "tiny" or "small" height is usually suspect, with the tiny bands 
being more suspect that the merely small. Moderate height bands, and tall bands, also require 
scrutiny. Tall bands are suspect because usually they are found amid stops, compressions, 
and, on slab gels, artifacts. In the example given in Step 1602, a band height of 0.18 is found 

20 to have membership in NormalHeight of 1 .0, which is to say that the band's height is within 
tolerances (See Example 1603). 

Referring to Step 1503, cross banding, the measure of competition by two traces for 
the same region of the trace, is also informative. Referring to Figure 1 7, the BaseQual cross 
banding membership functions characterize an observed band's cross banding measurement. 

25 The cross banding measurement is the ratio of the dominant trace to the next dominant trace. 
Ratios above 1 .5 are deemed to have negligible cross banding, whereas those with lower 
ratios (with 1 .0 being the lowest ratio possible) are suspect. Referring to the example in 
Figure 1702, a cross banding ratio of 1.35 is found to have membership in negligibleXb of 
0.33 (and 0.67 in the negation, InegligibleXb). Referring to Step 1504, band width 

30 (normalized based on a quadratic fit of observed band widths), is another informative 

variable. In Figure 18, the BaseQual band width membership function, a band's observed 
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width is normalized with respect to the expected band width. The intent of the membership 
function is to determine how normal the normalized band width is. Referring to the example 
in Step 1 802, a normalized width of 0.2 has membership in the Normal set of 1 .0 (See 
Example 1 803). Referring to Step 1 505, the band shape, the linear correlation 
5 coefficient between the coefficients of a quadratic fit of the band and the coefficients of a 
quadratic fit of an ideal band, identifies abnormally shaped bands. The BaseQual band shape 
membership function is informative in determining the quality of the base call. The range of 
band heights and widths observed in a run varies considerably. In one embodiment, sample 
rate conversion normalizes the observed band width, the band amplitude was normalized to 

10 1 .0, and the result was then compared against an ideal, gaussian bell-shaped band. The 

approach is computationally expensive and much information regarding the observed shape is 
discarded through the morphing process. 

In a more preferred embodiment, each band's height values are fit with a quadratic 
curve. Similarly, an ideal band shape is fit with a quadratic curve. (The ideal band shape is 

15 defined to have normal height and the expected width.) This approach reduces each sample 
set to an equal number of points. The shape metric is taken as the linear correlation 
coefficient of these two sample sets. Experience has shown that a band's shape is "abnormal" 
if this shape metric falls below 0.5. Referring to the example in Step 1902, a shape metric of 
0.6 is found to have membership in GoodShape of 1 .0 (See example 1903). 

20 Referring to Step 1 506, "baseline buzz," defined as the ratio of two other ratios, helps 

identify regions of the trace (usually the ends) where there is competition by several traces for 
the called band's domain. Toward the margins of a trace the baseline can often become quite 
busy, and when it does the quality of the underlying data, and the reads made thereon, 
become suspect. Baseline buzz can result from either incorrect signal processing, or from 

25 underlying data being so erratic as to defy correct signal processing. In either case the called 
sequence should come under suspicion. Referring to Figure 20, a buzz measurement above 
0.2 begins to signal a problematic sequence. Referring to the example is step 2003 and 
Example 2004, a buzz measurement of 0.28 has membership in okBuz of 0.63 (and 0.37 in 
the negation, lokBuz). In this case, the band's quality has come into question. 

30 Referring to Step 1 507, the gaps to a band's left and right neighbor are further 

informative variables in assessing band quality. These measurements help identify bands 
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that, despite all previous efforts to the contrary, remain positioned too close or too far from a 
preferred position. Referring to Figure 21 for details of BaseQual band spacing membership 
functions, a band's observed spacing is normalized with respect to the expected spacing. The 
intent of the membership function is to determine how normal the normalized band spacing 
is. In the example given in blocks 2102 and 2103, a normalized spacing of 0.2 receives an 
unqualified OK, with membership in OK Spacing of 1 .0. 

Referring to Step 1508, logical combinations of several variables (e.g. buzz, width, 
shape, and spacing) help keep the rules for assigning a quality value to a band tractable. 
Variable I Bad indicates that one of the measures was out of tolerance. Similarly, variables 
2Bad, 3Bad* and 4Bad indicate that two, three, or all four measurements are out of tolerance. 
Finally, variable 40k notes that all four measurements are in tolerance. 

Subsequently, a quality assessment is determined through application of a series of 
nine rules. In RULE QUAL10, the lowest quality assessment is made of bands that are tiny 
in height and are incorrectly positioned (Step 1509). For a band which matches this rule to 
some degree, the rule will assign a nonzero scale value to the output set with centroid near 0 
(Step 1519). The second quality assessment, RULE QUAL20, is made of bands that are 
short, show signs of cross banding, and are incorrectly positioned (Step 1510). For a band 
which matches this rule to some degree, the rule will assign a nonzero scale value to the 
output set with centroid near 13 (Step 1519). 

The third quality assessment, RULE QUAL30, is made of bands which are tiny in 
height yet correctly positioned (Step 1511). A band which matches this rule to some degree 
will be assigned a nonzero scale value to the output set with centroid near 25 (Step 1519). 
The fourth quality assessment, RULE QUAL40, is made of bands with small or moderate 
height and which show signs of cross banding or have 3Bad or 4Bad attributes (Step 1512). 
A band which matches this rule to some degree will be assigned a nonzero scale value to the 
output set with centroid near 38 (Step 1519). 

The fifth quality assessment, RULE QUAL50, is made of bands with small or 
moderate height and which show either some degree of cross banding or have 2Bad or 3Bad 
attributes (slightly better than quality class 4 in that these might have one less bad attribute) 
(Step 1513). A band which matches this rule to some degree will assign a nonzero scale 
value to the output set with centroid near 50 (Step 1519). RULE QUAL60, the sixth quality 
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assessment, is applied to bands with OK height but which show signs of baseline buzz, non- 
negligible cross banding, or have 2Bad attributes (Step 1514). A band which matches this 
rule to some degree will assign a nonzero scale value to the output set with centroid near 63 
(1519). 

5 Bands which have higher degrees of quality satisfy the seventh to ninth quality 

assessments. The seventh quality assessment, RULE QUAL70. is made of bands in one of 
three general classes. (Step 1515), One class of bands has OK height, little baseline buzz, 
negligible cross banding, but has 2Bad attributes. Another class of bands shows negligible 
cross banding, has OK height, no baseline buzz, is correctly positioned, but has both 
10 abnormal width and bad shape. (This class, named runfiL is characteristic of a band inserted 
within a poorly resolved run of bands). The final class of bands has 40k attributes but has 
small height and possibly some degree of cross banding present. A band which matches this 
rule to some degree will assign a nonzero scale value to the output set with centroid near 75. 
(Step 1519). 

1 5 The eighth quality assessment, RULE QU AL80, is made of bands with OK height, 

little baseline buzz, negligible cross banding, but IBad attribute. (Step 1516). A band which 
matches this rule to some degree (many do) will assign a nonzero scale value to the output set 
with centroid near 88. (Example 1519). The top quality assessment, RULE QUAL90, is 
made of bands with absolutely nothing visually wrong with them (Step 1517). A band which 

20 matches this rule to some degree (again, given good quality input, many do) will assign a 
nonzero scale value to the output set with centroid near 100 (Step 1519). Finally, as with all 
the other fuzzy logic blocks, the output sets are scaled with the strength of their respective 
rule firings, and the centroid is calculated to determine the final quality assessment. 

25 E. Final Sequence Assembly 

The final quality assessments from the BaseQual Fuzzy Logic analysis control the 
length of the final sequence 109. Where high quality sequence data is desired, the quality 
assessments determines may limit the read length of the final sequence. Where longer read 
lengths are desired, and lower quality sequence is acceptable, the quality assessments can aid 
30 is correlating the resulting sequence data from other sequence analysis. For example, when 
overlapping sequences are obtained, the quality assessments can determine which base calls 
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are more reliable. Similarly, when both strands of a DNA sequence are available, the quality 
assessments aid in identifying higher probability base calls. 

F. Computer Implementation of the Base Calling Software 

5 

The invented Base Calling Software can be implemented on standard desktop 
computers, such as Pentium- and 486-containing PC's. Computers with less powerful 
processors are also suitable, although the overall processing time for each input data set will 
be slower. Such computers will preferably include at least a central processing unit, dynamic 
1 0 memory and a device for outputting processed information. The invented base calling 
software can be stored on any suitable storage media, including computer diskettes, 
removable media, hard-drives, CD's, magnetic tapes and similar electronic storage means. 
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/********************************** t*+*4^*******t****t*++*+*^**** 

* FILE: AboutBQ.hxx 

* AUTHOR: Andy Marks 
*/ 

5 #if!definedC_ABOUTBQ_HXXj 

* define _ABOUTBQ_HXX_ 
#ifdefined(WIN32) 

class _declspec( dllexport ) AboutBQ 
#else 

10 class AboutBQ 
#endif 
{ 

public: 
AboutBQO; 
15 -AboutBQO; 

char const* productName() const; 

float productVersion() const; 

int numProceduresO const; 

char const* procedureName(int pdx) const; 

20 }; 

#endif 

y*«#4< ******************** ***4>*%*4>** *********** ************ 

* FILE: AboutBQ.cxx 
25 * AUTHOR: Andy Marks 

*/ 

#include <basecall/AboutBQ.hxx> 
static char const* plut_[] = 
{ 

30 "Base Calling With Quality Metrics", 
"Set Spectral Separation Matrix", 
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"Set Read Start And End Scanlines", 
"Illegal Procedure Index" 

}; 

#define NPS (sizeof(plutJ/sizeof(plut_[0])) 

AboutBQ::AboutBQ() 

{ 

) 

AboutBQ::~AboutBQ() 

{ 

} 

float 

AboutBQ::productVcrsion() const 

{ 

return 0.995f; 

} 

char const* 

AboutBQ::productName() const 
{ 

static char const* prod = M BaseQual: Cimarron Software Inc."; 
return prod; 

} 

int 

AboutBQ;:numProcedures() const 
{ 

return NPS-1; 

} 

char const* 

AboutBQ::procedureName(int idx) const 
{ 

if(idx<0 i| idx>=NPS) 
idx = NPS-1; 
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return plut_[idx]; 

} 

i********************************************** **+***+*******+**** 

5 * FILE: bandqual.cxx 
*/ 

#include <basecall/mb.hxx> 
^include <basecall/fuzzyset.h> 
void 

10 RdrOut::bandqual() 

{ 

static double const tinyh_x[] = {0.00, 0.02, 0.03}, 

tinyh_y[]= {1.00, 1.00,0.00}; 
static double const smaih_x[] = {0.015, 0.025, 0.04, 0.05}, 
1 5 smalh_j[] = { 0.000, 1 .000, 1 .00, 0.00} ; 

static double const modrh_x[] = {0.035, 0.045, 0.14, 0,16}, 

modrh_y[] = {0.000, 1.000, 1.00,0.00}; 
static double const normh_x[] = {0.135, 0.145, 0.48, 0.55}, 

nornuVy[]= {0.000, 1.000, 1.00,0.00}; 
20 static double const tallh_x[] = {0.475, 0.485, 1.00}, 

tallh_y[] = {0.000. 1.000, 1.00}; 

# define TTNYH_SZ (sizeof(tinyh_x)/sizeof(tinyh_x[0]» 

# define SMALH_SZ (sizeofi[smalh_x)/sizeof(smalh_x[0])) 

# define MODRH_SZ (sizeof(modrh_x)/sizeof(modrh_x[0])) 
25 # define NORMH_SZ (sizeofi(normh_x)/si2eof(normh_x[0])) 

# define TALLH_SZ (sizeof(tallh_x)/sizeof(tallh_x[0])) 
static double const presx_x[] = { 1 .0, 1 .35} , 

presx_y[]={1.0, 0.0}; 
static double const neglx_x[] = {1.35, 1.5}, 
30 neglx_y[]= {0.00, 1.0}; 

# define PRESX_SZ (sizeof(presx_x)/sizeof(presx_x[0])) 
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# define NEGLX_SZ (sizeof(neglx_x)/sizeof(neglx_x[0])) 
static double const normw_x[] = {-1 .0, *0.4, 0.4, 1 .0}, 

normw_y[) = { 0.0, 1.0, 1.0, 0.0}; 

# define NORMWSZ (sizeof(normw_x)/si2eof(normwx[0])) 
5 static double const goods_x[] = { 0.4, 0.5, 1 .0}, 

goods _y[]={ 0.0, 1.0,1.0); 

# define GOODS_SZ (sizeof(goods_x)/sizeof(goods__x[0])) 
static double const goodg_x[] = {0.3, 0.5, 0.7}, 

goodg_y[] = {1.0, 0.0,1.0}; 
1 0 # define GOODG_SZ (sizeof(goodg_x)/sizeof(goodg_x[0])) 
static double const okbuz_x[] = {0.0, 0.2, 0.4}, 
okbuz_j[)={1.0, 1.0, 0.0}; 

# define GOODB_SZ (sizeof(okbuz_x)/sizeof(okbuz_x[0])) 
static double const concl_x[] = {-0.100, 0.010, 0.125 }; 

15 static double const conc2_x[] = { 0.010, 0.125, 0.250 }; 

static double const conc3jc[] = { 0. 125, 0.250, 0.375 }; 

static double const conc4_x[] = { 0.250, 0.375, 0.500 }; 

static double const conc5_x[] = { 0.375, 0.500, 0.625 }; 

static double const conc6_x[] = { 0.500, 0.625, 0.750 }; 
20 static double const conc7_x[] = { 0.625, 0.750, 0.875 }; 

static double const conc8_x[] = { 0.750, 0.875, 0.990 }; 

static double const conc9_x[] = { 0.875, 0.990, 1.105 }; 

static double const concn_y[] = {0.0, 1.0, 0.0}; 

# define CONCL^SZ (sizeof(concl_x)/sizeof(concl_x[0]» 

25 static double const concjcQ = {0.0, 1 .0}, conc_y [] = {0.0, 0.0} ; 

# define CONCJSZ (sizeof(conc_x)/sizeof(conc_x[0])) 
CFuzzySet *tinyH, *smalH, *modrH, *noimH, *tallH; 
CFuzzySet *presX. *neglX, *normW, *goodS, *goodG, *goodB; 
int NB = bandStats_.Ien(), idx; 

30 tinyH = ConstructCFuzzySet( TINYH_SZ, tinyh_x, tinyhj, NOHEDGE ); 

smalH = ConstructCFuzzySet( SMALH_SZ, smalh_x, smalh_y, NOHEDGE ); 
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modrH = ConstructCFuzzySet( MODRH_SZ, modrhx, modrh_y, NOHEDGE ), 
nonnH = ConstructCFuzzySet( NORMH_SZ. normh_x, normh_y, NOHEDGE ), 
tallH = ConstmctCFuzzySet( TALLH SZ, tallh_x, tallh_y, NOHEDGE ); 
presX = ConstructCFuzzySet( PRESX_SZ, presx_x, presx_y, NOHEDGE ); 
5 neglX = ConstructCFuzzySet( NEGLX_SZ, neglx_x, neglx_y, NOHEDGE ); 

normW = ConstructCFuzzySet( NORMW_SZ, normw_x, normw_y, NOHEDGE ); 
goodS = ConstructCFuzzySet( GOODS_SZ, goods_x, goods_y, NOHEDGE ); 
goodG = ConstructCFuzzySet( GOODG_SZ, goodg_x, goodg_y, NOHEDGE ); 
goodB = ConstructCFuzzySet( GOODB_SZ, okbuz_x, okbuz_y, NOHEDGE ); 
10 for(idx=0;idx<NB:idx-H-) { 

double tinyh, smalh, modrh, normh, tallh, okbuz, runchl, runfil, fourok; 

double presx. neglx, normw, goods, goodgl,goodg2,okgp, conj, okhght; 

CFuzzySet *CONCLUSION, *C1, *C2, *C3, *C4, *C5, *C6, *C7, *C8, *C9; 

CONCLUSION = ConstructCFuzzySet( CONC_SZ, concx, conc_y, NOHEDGE ); 
15 C 1 = ConstructCFuzzySet( CONCL_SZ, concl _x, concn_y, NOHEDGE ); 

C2 = ConstructCFuzzySet( CONCL_SZ, conc2_x, concn_y ( NOHEDGE ); 

C3 = ConstructCFuzzySet( CONCL_SZ, conc3_x, concn_y, NOHEDGE ); 

C4 = ConstructCFuzzySet( CONCL_SZ, conc4_x, concn_y, NOHEDGE ); 

C5 = ConstructCFuzzySet( CONCL_SZ, conc5_x, concn_y, NOHEDGE ); 
20 C6 = ConstructCFuzzySet( CONCL_SZ, conc6_x, concn_y, NOHEDGE ); 

C7 = ConstructCFuzzySet( CONCL_SZ, conc7 x, concn_y, NOHEDGE ); 

C8 - ConstructCFuzzySet( CONCL_SZ, conc8_x, concn_y, NOHEDGE ); 

C9 = ConstructCFuzzySet( CONCL_SZ, conc9_x, concn_y, NOHEDGE ); 

tinyh = tinyH->membership( tinyH, bandStats .hght(idx) ); 
25 smalh = smalH->membership( smalH, bandStats_.hght(idx) ); 

modrh = modrH->membership( modrH, bandStats .hght(idx) ); 

normh = normH->membership( normH, bandStats .hght(idx) ); 

tallh = tallH->membership( tallH, bandStats_.hght(idx) ); 

presx = presX->membership( presX, bandStats_.xbnd(idx) ); 
30 neglx = neglX->membership( neglX, bandStats_.xbnd(idx) ); 

normw = normW->membership( normW. bandStats.widt(idx) ); 
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goods = goodS->membership( goodS, bandStats_.shap(idx) ); 

okbuz = goodB->membership( goodB, bandStats_.buzz(idx) ); 

goodgl = goodG->membership( goodG, bandStats_.sgap(idx) ); 

goodg2 = goodG->membership( goodG, bandStats_.lgap(idx) ); 
5 okgp = AND(goodgl,goodg2); 

Cl->scale( CI, AND( tinyh, NOT(okgp)) ); 

C2->scale( C2, AND( smalh, AND(presx,NOT(okgp))) ); 

C3->scale( C3, AND( tinyh, okgp) ); 

okhght = OR(modrh,OR(normh,tallh)); 
1 0 runch 1 = AND(NOT(goods),okhght); 

runfil = AND(runchl.AND(neglx,AND(nonnw,AND(okbuz.okgp)))); 

fourok = AND(okbuz,AND(normw,AND(goods,okgp))); 

double cjl,cj2,cj3,cj4,cj5,cj6, _lbad, _2bad, _3bad, _4bad; 

cjl = AND(okbuz.AND(normw,AND(goods,NOT(okgp)))); 
15 cj2 = AND(okbuz,AND(normw,AND(NOT(goods),okgp))); 

cj3 = AND(okbuz,AND(NOT(normw),AND(goods,okgp))); 

cj4 = AND(NOT(okbuz),AND(normw,AND(goods,okgp))); 

_1 bad = OR(cj 1 ,OR(cj2,OR(cj3,cj4))); 

cjl = AND(okbuz.AND(normw,AND(NOT(goods),NOT(okgp)))); 
20 cj2 = AND(okbuz,AND(goods,AND(NOT(normw),NOT(okgp)))); 

cj3 = AND(okbuz,AND(okgp,AND(NOT(normw),NOT(goods)))); 

cj4 = AND(normw,AND(goods^\ND(NOTCokbuz),NOT(okgp)))); 

cj5 = AND(normw,AND(okgp,AND(NOT(okbuz),NOT(goods)))); 

cj6 = AND(goods,AND(okgp,AND(NOT(okbuz),NOT(normw)))); 
25 _2bad = OR(cj 1 ,OR(cj2,OR(cj3,OR(cj4,OR(cj5,cj6))))); 

cjl = AND(okbuz,AND(NOT(normw),AND(NOT(goods),NOT(okgp)))); 

cj2 = AND(normw,AND(NOT(okbuz),AND(NOT(goods),NOT(okgp)))); 

cj3 = AND(goods,AND(NOT(okbuz),AND(NOT(normw),NOT(okgp)))); 

cj4 = AND(okgp,AND(NOT(okbuz),AND(NOT(normw),NOT(goods))»; 
30 _3bad = OR(cj 1 ,OR(cj2,OR(cj3,cj4))); 

_4bad = AND(NOT(okbuz),AND(NOT(nortnw»ND(NOT(goods).NOT(okgp)))); 
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C4->scale(C4,AND(OR(smalh.modrh),OR(presx, OR(_3bad._4bad)))); 

C5->scale(C5,AND(OR(smalh,modrh),OR(NOT(neglx),OR(_2bad,_3bad)))); 

C6->scale(C6,AND(okhght,OR( OR(NOT(okbuz),NOT(neglx)), _2bad ))); 

conj = AND(okhght,AND(okbuz,AND(neglx,_2bad))); 
5 C7->scaie(C7,OR(conj,OR(runfil,AND(smalh,AND(NOT(presx),fourok))))); 

C8->scale(C8,AND(okbuz,AND(okhght,AND(neglx,_lbad)))); 

C9->scale(C9,AND(okhght,AND(neglx,fourok))); 

CONCLUSION->cj( CONCLUSION, CI, DISJ ); 

CONCLUSION->cj( CONCLUSION, C2, DISJ ); 
10 CONCLUSION->cj( CONCLUSION, C3, DISJ ); 

CONCLUSION->cj( CONCLUSION, C4, DISJ ); 

CONCLUSION->cj( CONCLUSION, C5, DISJ ); 

CONCLUSION->cj( CONCLUSION, C6, DISJ ); 

CONCLUSION->cj( CONCLUSION, C7, DISJ ); 
1 5 CONCLUSION->cj( CONCLUSION, C8, DISJ ); 

CONCLUSION->cj( CONCLUSION, C9, DISJ ); 

double concl; 

(void)CONCLUSION->fcentroid( CONCLUSION, &concl ); 

bandStats_.qual(idx,float(concl)); 
20 DestructCFuzzySet(CONCLUSION); 

DestructCFuzzyScl(Cl ); 

DestructCFuzzySet(C2); 

DestructCFuzzySet(C3); 

DestructCFuzzySet(C4); 
25 DestructCFuzzySet(C5); 

DestructCFuzzySet(C6); 

DestructCFuzzySet(C7); 

DestructCFuzzySet(C8); 

DestructCFuzzySet(C9); 

30 } 

DestructCFuzzySet(tinyH); 
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DestructCFuzzySet(smalH); 
DestructCFuzzySet(modrH); 
DestructCFu2zySet(normH); 
DestructCFuzzySet(tallH); 
5 DestructCFuzzySet(presX); 
DestructCFuzzySet(neglX); 
DestmctCFuzzySet(normW); 
DestructCFuzzySet(goodS); 
DestructCFuzzySet(goodG); 
1 G DestructCFuzzySet(goodB); 
} 

f****^***************** ****** ************************************* 

* FILE: blindeconv.cxx 
15 * AUTHOR: Andy Marks 
*/ 

#include <basecall/mb.hxx> 
#include <nrc/Complex.hxx> 
ttfdefined(DEBUG) 
20 static void 

dzp( double const* p. int n, char const* fname ) 

{ 

FILE* fp = xfopentfname/W'); 
if(NULL != fp) { 
25 Complex const* z = (Complex const*)&p[-l ]; 
for(int idx=l; idx<=n; idx++) 
::fprintf(rp > "%l 1.6F %1 1.6An' , ,z[idx].real(),zfidx].imag()); 
::fclose(fp); 
} 

30 } 

static void 
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drp( double const* p, int n, char const* fname ) 
{ 

FILE* fp= ::fopen(fname,"w"); 
if(NULL != fp) { 
5 for(int idx= 1 ; idx<=n; idx++) 
: :fprintf(fp,"% 1 1 .6 An" ,p[idx]); 
::fclose(fp); 
} 

} 

10 flendif 
static void 

fftshift( double* v, int n ) 
{ 

int idx. mid = n/2; 
1 5 for(idx=l ; idx<=mid; idx-H-) { 
double t = v[idx]; 
v[idxj = v[mid+idx]; 
v[mid+idxj = t; 
} 

20 } 

static void 

dCadd( double* cl , double const* c2, int n ) 

{ 

Complex* zl = (Complex*)&cl[-l]; 
25 Complex const* z2 = (Complex const*)&c2[-l]; 
for(int idx=l; idx<=n; idx++) 
zl[idx] +=z2[idx]; 

} 

static void 
30 dCln( double* pc, int n ) 

{ 
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Complex* z = (Complex*)&pc[-l]; 
for(int idx=l ; idx<=n; idx++) 
z[idx]Jn(); 

} 

5 static void 

dCexp( double* pc, int n ) 

{ 

Complex* z = (Complex* )&pc[-l]; 
for(int idx=l ; idx<=n; idx++) 
10 z[idx].exp(); 
} 

static void 

dRCmul( double* pc, double const* pr, int n ) 

{ 

1 5 Complex* z = (Complex*)&pc[-l ]; 
for(int idx=l ; idx<=n; idx++) 
z[idx].cmul(pr[idx]); 

} 

static void 

20 dRkCmul( double* pc, double Rk, int n ) 

{ 

Complex* z = (Complex*)&pc[-l]; 
for(int idx=l ; idx<=n; idx-H-) 
z[idx].cmul( Rk ); 

25 } 

enum RI { DREAL, DIMAG }; 
static void 

dGetComponent( double const* pin, double* pout, Rl ri. int n ) 
{ 

30 int idx, kdx; 

if(DREAL == ri) { idx = 1; kdx = 1 ; } 
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else { idx = 2;kdx = -l; } 
for( ; idx<=2*n; idx+=2) { 
pout[idx+kdx] = 0.0; 
pout[idx] = pin(idx]; 
5 } 
} 

static void 

dCNozeros( double* ivec, int n ) 

{ 

10 Complex* z = (Complex*)&ivec[-l]; 
for(int idx=l; idx<=n; idx++) 
if(z[idx] = 0.0) 
z[idx].real( DBLJEPSILON ); 

} 

15 void 

SegRead::blindeconv( Wvfm& wv, int FBW ) 
{ 

#ifdefined(DEBUG) 

static char call = 0; 
20 #endif 

int lane, jdx, sdx; 
#ifdefined(DEBUG) 

call++; 
#endif 

25 if(FBW != pmb_->lastFbw J { 
double fsigma, alpha, c; 
pmb_->lastFbw_ = FBW; 

fsigma = double((FBW-l)*2)*NR_PI/double(NPTS); 
alpha = 0.5* fsigma* fsigma; 
30 c = ::sqrt(NR_PI/alpha); 

for(sdx=l ; sdx<=NPTS; sdx++) 
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pmb_->filter_[sdx) = c * ::exp( -pmb_->ww_[sdx] / (4*alpha) ); 
::fftshift( pmb_->filter_, NPTS ); 

} 

for(lane=l; lane<=4; lane++) { 
5 for(jdx=sdx=l ; sdx<=NPTS; jdx+=2, sdx+=l ) { 
pmb_->ivec_[jdx] = wv.sc_la( sdx, lane ); 
pmb_->ivecjjdx+l] = 0.0; 
} 

::fftshift( pmb_->ivee_, 2*NPTS ); 
1 0 ::dfourl ( pmb_->ivec_, NPTS, 1 ); 

::dCNozeros( pmb_->ivec_, NPTS ); 
::dCln( pmb_->ivec_, NPTS ); 

: : dGetComponent(pmb_->i vec_,prab_->imag JDIM AG,NPTS) ; 
::dGetComponent( pmb_->ivec_, pmb_->ivec_, DREAL, NPTS ); 
1 5 ::dfourl ( pmb_->ivec_, NPTS, -1 ); 

::dRkCmul( pmb_->ivec_, 1 .0/double(NPTS), NPTS ); 
#ifdefined(DEBUG) 

if(l==call)drp(pmb_->lifter_,NPTS, M lifter"); 

if( 1 ==call) dzpOJmb_->ivec_,NPTS,"cepstrum"); 
20 tfendif 

::dRCmul( pmb_->ivec_, pmb_->lifter_, NPTS ); 

::dfourl( pmb_->ivec_, NPTS, 1 ); 

::dGetComponent(pmb_->ivec_,pmb_->ivec_,DREAL,NPTS ); 

::dCadd( pmb_->ivec_, pmb_->imag_, NPTS ); 
25 ::dCexp( pmb_->ivec_, NPTS ); 

::dRCmul( pmb_->ivec_, pmb_->filter_, NPTS ); 

::dfourl( pmb_->ivec_, NPTS, -1 ); 

::dRkCmul( pmb_->ivec_, 1 .0/double(NPTS), NPTS ); 

::fftshift( prnb_->ivec_, 2*NPTS ); 
30 if(l=pmb_->remune()) 

for(sdx=jdx=l; sdx <=NPTS; sdx+=l, jdx+=2) 
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wv.sc_la_set(sdx.lane, pmb_->ivec_[jdx]); 

} 

} 

5 /*♦**♦♦********♦**********♦*************»**************************** 

* FILE: Centroid.cxx 
♦AUTHOR: Andy Marks 
*/ 

#include <nrc/Centroid.hxx> 
10 int 

icentroid( int const* px. int const* py, int n ) 
{ 

int rv; 

if{l=n) 
15 rv = px[l]; 
else if(2==n) 
rv = (px[l]+px[2]+l)/2; 
else { 

long numer=0L, denom=0L; 
20 for(int i=2; i<=n; i++) { 

long t = py[i-l]*(2*px[i-l] + px[i]) + py[i]*(px[i-l]+2*px[i]); 

numer = numer + t*(px[i]-px[i-l ]); 

denom = denom + (px[i] - px[i-l])*(py[i-l]+py[i]); 

} 

25 rv = int(0.5f + float(numer)/float(3L*denom)); 
} 

return rv; 

} 

float 

30 fcentroid( float const* px, float const* py, int n ) 
{ 
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float rv; 
if(l=n) 
rv = px[l]; 
else if(2=n) 
5 rv = (px[l]+px[2])/2.0f; 
else { 

double numer=0.0, denom=0.0; 
for(int i=2; i<=n; i++) { 

double t = py[i-l]*(2.0*px[i-l] + px[i]) + py[i]*(px[i-l]+2.0*px[i]); 
10 numer = numer + l*(px[i]-px[i-l]); 

denom = denotn + (px[i] - px[i-l])*(py[i-l]+py[i]); 
} 

rv = float(numer/3.0*denom); 

} 

15 return rv; 

} 

* FILE: Centroid.hxx 
20 * AUTHOR: Andy Marks 

*/ 

#if !definedLCENTROID_HXX_) 

# define _CENTROID_HXX_ 
^include <nrc/nrutil.hxx> 

25 DLLexport int icentroid(int const* px, int const* py, int n); 

DLLexport float centroid(float const* px, float const* py, int n); 
#endif 

30 * FILE: Complex.cxx 
♦AUTHOR: Andy Marks 
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* COPYRIGHT (c) 1996, University of Utah 
*/ 

#include <nrc/Complex.hxx> 
Complex const& 
5 Complex::operator=(Complex const& b) 

{ 

if(this != &b) { 
r_ = b.r_; 
i_ = b.i_; 
10 } 

return *this; 

} 

Complex 

Complex: :operator+(Complex const& b) const 
15 { 

Complex c; 
c.r_ = r_ + b.r_; 
c.i_ = i_ + b.i_; 
return c; 

20 } 

Complex const& 

Complex: :operatori-=(Complex const& b) 

{ 

*this=*this + b; 
25 return *this; 

} 

Complex 

Complex::operator*(Complex const& b) const 
{ 

30 Complex c; 

c.r_ = r_*b.r_- i_*b.i_; 
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c-L ~ L*b- r _ + r _*b.i_; 
return c; 

} 

Complex const& 
5 Complex: :operator*=(Compl ex const& b) 

{ 

♦this = *this * b; 
return *this; 

} 

10 Complex 

Complex: :operator-(Complex const& b) const 
{ 

Complex c; 
c.r_ = r_ - b.r_; 
15 c.i_ = i_ - b.i_; 
return c; 

} 

Complex const& 

Complex: :operator-=(Complex const& b) 
20 { 

♦this = ♦this - b; 
return + this; 

} 

Complex 

25 Complex: :operator/(Complex const& b) const 
{ 

Complex c; 
double R, D; 

if(::fabs(b.rj >= ::fabs(b.i J) { 
30 iflfl.O = b.rj return c; 
R =b.i_/b.r_; 
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D = b.r_ + R*b.i_; 
if(0.0 = D) return c; 
c.r_ = (r_ + R*iJ/D; 
c.i_ « (i_ - R*r_)/D; 

5 } 
else { 

if(0.0 = b.i_) return c; 
R =b.r_/b.i_; 
D = b.i_ + R*b.r_; 
10 if(0.0 = D) return c; 
c.r_ = (R*r_ + iJ/D; 
c.i_ = (R*i_ - rJ/D; 
} 

return c; 

15 } 

Complex const& 

Complex: :operator/=(Complex const& b) 
{ 

♦this = *this / b; 
20 return *this; 
} 

void 

Complex::exp() 

{ 

25 double ez = ::exp( r_ ); 
r_ = ez * ::cos( i_ ); 
i_ = ez * ::sin( i_); 

} 

void 

30 Complex::ln() 
{ 
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double ang = angle(); 

r_ = 0.5 * ::log( r_*r_ + i_*i_ ); 

i_ = ang; 

} 

void 

Complex::conj() 

{ 

i_ = -i_; 

} 

void 

Complex::cmul( double k ) 

{ 

r_*=k; 
i_ *- k; 

} 

void 

dCMul( double const* a, double const* b, double* c, int n) 

{ 

Complex const *za = (Complex const*)&a[-I], 

*zb = (Complex const*)&b[-l]; 
Complex *zc = (Complex*)&c[- 1 ]; 
for(int idx=l ;idx<=n;idx++) 
zc[idx] = za[idx] * zbpdx]; 

} 

* FILE: Complex.hxx 

* AUTHOR: Andy Marks 

* COPYRIGHT (c) 1996, University of Utah 
*/ 

#ifndef_COMPLEX_HXX_ 
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#defme _COMPLEX JIXX_ 
^include <math.h> 
class Complex 
{ 

5 public: 

Complex( double real = 0.0, double imag = 0.0 ) : rjreal), ijimag) {}; 

Complex( double const nr[2] ) : r_(nr[0]), i_(nr[l]) {}; 

CompJex( Complex const& rhs ) : rJO.O), i_(0,0) { *this = rhs; ); 

-Complex() {}; 
10 Complex const& operator=(Complex const& b); 

Complex operator+(Complex const& b) const; 

Complex operator*(Complex const& b) const; 

Complex operator-(Complex const& b) const; 

Complex operator/(Complex const& b) const; 
1 5 int operator=(Complex const& b) const {retum(r_=b.r_ && i_==b.i_);> 

Complex const& operator+=(Complex const& b); 

Complex const& operator*=(Complex const& b); 

Complex const& operator— (Complex const& b); 

Complex const& operator/=(Complex const& b); 
20 void real(double r) { r_ = r; } 

void imag(double i) { i_ = i; } 

void exp(); 

void ln(); 

void conj(); 
25 void cmul(double k); 

double real() const { return r j } 

double imag() const { return i_; } 

double abs() const { return ::sqrt(r_*r_ + i_*ij; } 

double angle() const { return ::atan2(i_,r_); } 

30 private: 

double r_, 
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L; 

>; 

void dCMul( double const* a, double const* b, double* c, int n); 
#endif 

/mi**************************** ************+**************************** 

♦FILE: dfourl.cxx 
♦TYPIST: Andy Marks 
*/ 

#include <nrc/nr.hxx> 

#define SWAP(a,b) { tempr=(a); (a)=(b); (b)=terapr; } 

#if !defined(SA) 

void 

dfourl( double data[], unsigned long nn, int isign ) 

{ 

unsigned long n, nunax, m, j, istep, i; 
double wtemp, wr,wpr,wpi,wi,theta; 
double tempr, tempi; 
n = nn«l ; 

j=i; 

for(i=l;i<n;i+=2) { 

if(j>0 { 
SWAP(data[j],data[i]); 
SWAP(data[j+l],data[i+l]); 
} 

m = n»l; 

while(m>=2&&j>m) { 
j-=m; 
m»= I; 

} 

j+=m; 
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} 

mmax = 2; 
while(n>mmax) { 

istep = mmax«l; 

theta = isign*(2.0*NR_PI/mmax); 

wtemp = sin(0.5*theta); 

wpr = -2.0*wtemp*wtemp; 

wpi = sin(theta); 

wr= 1.0; 

wi = 0.0; 

for(m=l;m<mmax;m+=2) { 
for(i=m;i<=n;i+= : istep) { 
j=i+mmax; 

tempr = wr*data[j] - wi*data[j+l ]; 
tempi = wr*data[j+1]+wi*data[j]; 
data[j] = data[i]-tempr; 
data[j+l] =data[i+l]-tempi; 
data[i] += tempr; 
data[i+l ] += tempi; 
} 

wr = (wtemp=wr)*wpr-wi*wpi+wr; 
wi = wi*wpr + wtemp* wpi + wi; 

} 

mmax = istep; 

} 

} 

#endif 

#ifdefmed(SA) 
#include <stdio.h> 
void 
mainQ 
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{ 

double *unitstep; 
size_t idx, jdx; 

if(NULL = (unitstep = dvector( 1,16))) 

nrerror("main: failed to allocate unitstep vector\n"); 
unitstepfl] = 1.0; unitstep[2] =0.0; 
unitstep[ 3] = 1 .0; unitstepf 4] = 0.0; 
unitstep[5] = 0.0; unitstep [ 6] =0.0; 
unitstep[7] = 0.0; unitstepf 8] =0.0; 
unitstepf 9] = 0.5; unitstepfl 0] =0.0; 
unitstepfl 1] = 0.5; unitstepf 12] =0.0; 
unitstep[13] = 0.0; unitstepf 14] =0.0; 
unitstep[15] = 0.0; unitstepf 16] =0.0; 
::printf("INPUT:\n"); 
for(idx = 1; idx<=16; idx++) 

printf("\tv[%21d]=% 1 1 .6 An n ,idx,unitstep[idx]); 
dfourl( unitstep, 8, 1 ); 
::printf("FFT(INPUT):\n"); 
for(idx = 1; idx<=16; idx++) 

printf("\tvf%21d]=%l 1 .6f\n",idx,unitstep[idx]); 
dfourl( unitstep, 8, -1 ); 
::printfriFFT(INPUT):\n"); 
for(idx = 1; idx<=16; idx++) 

printf( ,, \tv[%21d]=%l 1 .6f\n M ,idx,unitstep[idx]); 
free_dvector(unitstep, 1 ,1 6); 

} 

fendif 

* FILE: fgapcheck.c 

* COPYRIGHT (c) 1996, University of Utah 
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*/ 

^include <basecall/fiizzyset.h> 
#include <basecall/fgapcheck.h> 
#ifdefined(SA) 
5 # define VERBOSE 
#else 

# undef VERBOSE 
flendif 
static double 
1 0 weight(size_t n, size_t gs, size_ t cs) 
{ 

return 1.0-sqrt((double)((n-gs)*(n-gs) + (n-cs)*(n-cs))/(2.0*(double)(n*n))); 

} 

static double 

1 5 gcness(size_t n, sizet gs, sizej cs, size_t mx, size_t my) 

{ 

size_t lh, sh; 
double normalize, rv; 
lh = (n+l)/2; sh = n-lh; 
20 if(0=mx) mx = lh; if(0=my) my = sh; 
normalize = weight(n,mx,my); 
rv = weightCn^s^synormalize; 
if(rv>1.0)rv-1.0; 
return rv*rv; 

25 } 
int 

gapcheck( 
int const* pis, 
int const* piw 
30 int const* ps, 
int const* pw, 
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char const* pseq, 
sizet nbands, 
float** output 

) 

5 { 

# define NPREV 5 

static double const bigwidXfJ = {0.0, 1.0}; 
static double const bigwidYf] = {0.0, 1.0); 

# define BGWDSZ (sizeof(bigwidY)/sizeofi(bigwidY[0])) 
1 0 static double const biggapXfl = {0.42,0.6, 1.0}; 

static double const biggapYf] = {0.0,1.0,1.0}; 
U define BGGPSZ (sizeof(biggapY)/sizeof(biggapY[0])) 
static double const mdgpX[] = { 0.25, 0.3, 0.45, 0.57 }; 
static double const mdgpY[] = { 0.0, 1.0, 1.0, 0.0 }; 
1 5 # define MDGPSZ (sizeof(mdgp Y )/sizeof(mdgpY[0])) 
static double const smgpX[] = {-1 .0, -0.5, 0.0}; 
static double const smgpY[] = {1.0, 1 .0, 0.0} ; 

# define SMGPSZ(sizeof(smgpY)/sizeof(smgpY[0])) 

# define A 0.5079 
20 # define B 1.5002 

# define C 1.5069 

# define D 2.5063 

static double const norm_x[] = { A, B, C }; 
static double const norm_y[] = { 1.0, 1.0, 0.0 }; 
25 # define NORMSZ (sizeof(norm_x)/sizeof(nonn_x[0])) 
static double const split_x[] = { B, C, D }; 
static double const split_y[] = { 0.0, 1 .0, 1.0 }; 

# define SPLITSZ (sizeof{split_x)/sizeof(split_x[0])) 
static double const concl_x[] = { A, D } ; 

30 static double const concl_y[] = { 0.0, 0.0 } ; 

# define CONCLSZ (sizeof(concl_x)/sizeof(concl_x[0])> 
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*cs; 
double *gc; 

CFuzzySet *ciBgGP, *ciMdGP, *ciSmGP, *ciBgWD; 
5 size_t idx; 
int rv = I ; 

gc = (double*)malloc(sizeof(*gc)*(nbands+l)); 
gs = (size_t*)malloc(sizeof(*gs)*(nbands+l)); 
cs = (size_t*)malloc(sizeof(*cs)*(nbands+l)); 
1 0 if(NULL==gs || NULL=cs || NULL=gc) { 

fprintf(stderr, n out of memory in fgapchcck.c, line %d\n*\_LINE_); 

if(NULL != gs) { free(gs); gs = NULL; } 

if(NULL N cs) { free(cs); cs = NULL; } 

if(NULL != gc) { free(gc); gc = NULL; } 
15 return 0; 

} 

ciBgWD = ConstructCFuzzySet( BGWDSZ, bigwidX, bigwidY, CONTINT ), 
ciBgGP = ConstructCFuzzySet( BGGPSZ, biggapX, biggapY, VERY ); 
ciMdGP = ConstructCFuzzySet( MDGPSZ, mdgpX, mdgpY, NOHEDGE ); 
20 ciSmGP = ConstructCFuzzySet( SMGPSZ ? smgpX, smgpY, CONTINT ); 
for(idx=l; idx<=nbands; idx-f+) { 
sizejbgnj, jdx; 
double expgaps, prvgaps = 0.0; 
bgnj = (idx>NPREV)?(idx-NPREV):l; 
25 gs[idx] = cs[idx] = 0; 

for(jdx = bgnj; jdx < idx; jdx++) { 
prvgaps += (double)ps(jdx]; 
switch( pseqjjdx] ) { 
case r G': gs[idx] 4= 1; break; 
30 case 'C: cs[idx] += 1 ; break; 

default; break; 
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} 

} 

expgaps = (double)(piw[idx]*(idx-bgnj)); 
if((0.0=expgaps) || (prvgaps<expgaps)) 
5 prvgaps = expgaps = 1 .0; 

gcpdx] = (expgaps/prvgaps)*gcness(NPREV,gs[idx],cs[idx],NPREV/2,NPREV/2); 
} 

for(idx=l; idx<=nbands; idx++) { 
CFuzzySet 'CONCLUSION, *R_SPLIT, *R_NORM; 
10 # define Nl 

double rawgap[N+l], rawwid[N+l], biggapfN+1], smlgap[N+l], bigwid[N+l], gcrich; 
double conjl, conj2, cmpti, concl; 
double medgap[N+l], fatso; 

R_NORM =ConstructCFuzzySet(NORMSZ, norm_x, norm_y, NOHEDGE); 
1 5 R_SPLIT = ConstructCFuzzySet( SPLITSZ, split_x, split_y, NOHEDGE ); 

CONCLUSION = ConstructCFuzzySet( CONCLSZ, concl_x, concl_y, NOHEDGE ); 
if(0=pisfidx] || 0=piw[idx]) { 

DestructCFuzzySet(R_SPLIT); 

DestructCFuzzySet(R_NORM); 
20 DestructCFuzzySet(CONCLUSION); 

rv = 0; 

break; 

} 

rawgapfN] = ((double)ps[idx]/(double)pis[idx]) - 1 .0; 
25 rawgap[N- 1 ] = ( 1 =idx)? 0.0: (((double)ps[idx- 1 ]/(double)pis[idx]) - 1 .0); 

rawwid[N] = ((double)pw[idx)/(double)piw[idx]) - 1 .0; 

rawwid[N-I] = (l=idx)? 0.0: (((double)pw[idx-l]/(double)piw[idx]) - 1.0); 

biggap(N] = ciBgGP->membership( ciBgGP, rawgapfN] ); 

medgap[N] = ciMdGP->membership( ciMdGP, rawgapfN] ); 
30 smlgapfN] = ciSmGP->membership( ciSmGP, rawgapfN] ); 

biggap[N-l] = ciBgGP->membership( ciBgGP. rawgapfN- 1] ); 
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medgap[N-l] = ciMdGP->membership( ciMdGP, rawgap[N-l] ); 

smlgap[N-l] = ciStnGP->membership( ciSmGP, rawgap[N-l] ); 

bigwid[N] = ciBgWD->membership( ciBgWD, rawwid[N] ); 

bigwidfN-1] = ciBgWD->membership( ciBgWD, rawwid[N-l] ); 
5 gcrich = gc[idx]; 

fatso = AND(bigwid[N-l J,AND(medgap[N-l]^nedgap[N])); 
#ifdefined(VERBOSE) 

printf(" pseq[%3d]=%c pseq[%3d]=%c\n",idx- 1 ,pseqlidx- 1 ],idx,pseq[idx]); 

printf(" ps[%3d]=%2d ps[%3d]=%d\n",idx-l,ps[idx-l], idx,ps[idx]); 
10 printf(" pis[%3dl=%2d pis[%3d]=%d\n".idx-l,pisfidx-l], idx,pis[idx]); 

printf( M pw[%3d]=%2d pw[%3d]=%d\n",idx-l,pwfidx-l], idx,pw[idx]); 

printf(" piw[%3d]=%2d piw[%3d]=%d\n"adx-l ,piw[idx- 1 ]. idx,piw[idxl); 

printf(''rawgap[%3d]=%52frawgap[%3d]=%5.2f\n^idx-l,rawgap[N-l]Jdx,rawgap|N 

printf("rawwid[ 0 /g3d]=°/o5.2f rawwid[%3d]=%5.2f\n",idx- 1 ,rawwid[N- 1 ],idx,rawwid[N]); 
1 5 printf("biggap[%3d]=%5 .2f biggap[%3d]=%5.2f\n",idx- 1 ,biggap[N- 1 ],idx,biggap[N]); 

p^intf( ,, medgap[%3d]=%5.2fmedgap[%3d]=%5.2f\n^idx-l,medgap[N-l],idx,medgapiN]); 

p^intf("smlgap[%3d]=%52fsmlgap[%3d]=%5.2f\n^idx-l,smlgap[N-l],idx ) smlgap[N]); 

printf(*^igwid[%3d]=%52f bigwid^ 

printf("gcrich = %4.2f\n",gcrich); 
20 printf("fatso = °/o4.2f\n", fatso); 
#endif 

conj 1 = AND(biggap[N],gcrich); 

conj2 = AND(biggap[N],smlgaprN-l]); 

conj2 = AND(conj2,NOT(bigwid[N]) ); 
25 conj2 = AND(conj2,NOT(bigwid(N-l ]) ); 

R_NORM->scale( R_NORM, OR(NOT(biggap[N]),OR(conjl,conj2)) ); 

conjl = AND(biggap[N],OR(bigwid[N-l],bigwid[N])); 

conj2 = AND(biggap[N],AND(NOT(smlgap[N-l]),NOT(gcrich))); 

R_SPLIT->scale( R_SPLIT, OR(conj 1 ,conj2)); 
30 CONCLUSION->cj( CONCLUSION, R_NORM, DISJ ); 

CONCLUSION->cj( CONCLUSION, R_SPLIT, DISJ ); 
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(void)CONCLUSION->fcentroid( CONCLUSION, &concl ); 
cmpti = CONCLUSION->compatIndex( CONCLUSION ); 
output[idx][l] = (float)concl; 
output[idx][2] = (float )cmpti; 
5 #ifdefined(VERBOSE) 

R_NORM->print( R NORM, "R_NORM" ); 
R_SPLIT->print( R_SPLIT, "R_SPLIT" ); 
CONCLUSION->print( CONCLUSION, "CONCLUSION" ); 
printf("idx=%d concl=%f cmpti=%f\n\n".idx,concl,cmpti); 
10 #endif 

DestructCFuzzySet(R_SPLIT); 

DestructCFuzzySet(R_NORM); 

DestructCFuzzySet(CONCLUSION); 

} 

15 DestructCFuzzySet(ciBgWD); 

DestructCFuzzySet(ciBgGP); 

DestructCFuzzySet(ciSmGP); 

DestructCFuzzySet(ciMdGP); 

free(gc); 
20 free(gs); free(cs); 

return rv; 

} 

#ifdefmed(SA) 
^include <stdio.h> 
25 #include <nrc/nrutil.hxx> 
int 

main(int argc, char* argvQ) 
{ 

static int sp[6], wd[6], isp[6], iwd[6], idx; 
30 char *p, *phd; 
if(2!=argc) { 
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return 0; 



fprintf(stderr,"%s: missing input field(s): [%s]\n",argv[0],phd); 
return 1; 

} 

#endif 



* FILE: fomitokn.c 

* COMP: c89 -DSA omitnok.c fuzzyset.o -o omitokn -lm 

* COPYRIGHT (c) 1996, University of Utah 



#include <float.h> 

#include <basecall/fuzzyset.h> 

int 

omitokn( 

int const* pinS, float const* pht, float const* plo, int const* pLsp, 
int const* pRsp, float const* pxb, int NPK, float** output ) 



*/ 



static double const okSpX[] 
static double const okSpY[] 
#ifdefined(AS_WAS) 



{0.2, 0.5, 0.8}; 
{1.0, OA 1.0); 



static double const abSpX[] 
#else 



{0.3,0.4, 0.6, 0.7}; 



static double const abSpX[] 



{0.2, 0.4, 0.6, 0.8}; 



#endif 
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static double const abSpY[] = {0.0, 1.0, 1.0, 0.0}; 

# define OKSPSZ (sizeof(okSpY)/sizeof(okSpY[0])) 

# define ABSPSZ (sizeof(abSpY)/sizeof(abSpY[0])) 
static double const tiXbX[] = {1.0, 1.4, 1.8}; 

5 static double const tiXbY[] = { 1 .0, 0.5, 0.0} ; 
static double const okXbXf] = { 1.2, 1.4}; 
static double const okXbY[] = {0.0, 1.0}; 

# define TlXbSZ (sizeof(tiXbX)/sizeofi(tiXbX[0])) 

# define OKXbSZ (sizeof(okXbX)/sizeof(okXbX[0])) 
10 static double tiHtX[] = {0.02.0.07} ; 

static double const tiHtY[] = { 1 .00,0.00} ; 
static double okHtX[] = {0.01.0.06}; 
static double const okHtY[] = {0.00,1.00}; 
U define TIHTSZ (sizeof(tiHtX)/sizeof(tiHtX[0])) 
1 5 # define OKHTSZ (sizeof(okHtX)/sizeof(okHtX[0])) 

static double const ok_x[] = {0.4596, 1.3797, 1.6864}; 
static double const ok_y[] = {1.0000, 1.0000, 0.0000}; 

# define OK_SZ (sizeof(ok_x)/sizeof(ok_x[0])) 

static double const n_x[] = {1.3797, 1.6864, 2.2998, 2.6065}; 
20 static double const n_y[] = {0.0000, 1.0000, 1.0000, 0.0000}; 

# define N_SZ (sizeof(n_x)/sizeof(n_x[0])) 

static double const omit_x[] = {2.2998, 2.6065, 3.5540}; 
static double const omit_y[] = {0.0000, 1.0000, 1.0000}; 
H define OMITSZ (sizeof(omit_x)/sizeof(omit_x[0])) 
25 static double const concl_x[] = {0.4596, 3.5540}; 
static double const concl_y[] = {0.0000, 0.0000}; 

# define CONCL_SZ (sizeof(concl_x)/sizeof(concl_x[0])) 
CFuzzySet *okSP. *abSP, *tiXb, *okXb, *tiHT, *okHT; 
double mean_plo = 0.0; 

30 intidx, rv=l; 

for(idx= 1 :idx<=NPK:idx++) mean_plo += plo[idx]; 
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mean_plo /= (double)NPK.; 
tiHtX[0] = 0.4*mean_plo; tiHtX[l] = l.l*mean_plo; 
okHtX[0] = 0.5*mean_plo; okHtX[l] = 1 .5*mean_plo; 
okSP = ConstructCFuzzySet( OKSPSZ, okSpX, okSpY, NOHEDGE ); 
#ifdefined(AS_WAS) 

abSP = ConstructCFuzzySet( ABSPSZ, abSpX, abSpY, NOHEDGE ); 
#else 

abSP = ConstructCFuzzySet( ABSPSZ, abSpX, abSpY, VERY ); 
#endif 

tiXb = ConstructCFuzzySet( TlXbSZ, tiXbX, tiXbY, NOHEDGE ); 
okXb = ConstructCFuzzySet( OKXbSZ, okXbX. okXbY, NOHEDGE ); 
tiHT = ConstructCFuzzySet( TIHTSZ, tiHtX, tiHtY. NOHEDGE ); 
okHT = ConstructCFuzzySet( OKHTSZ, okHtX, okHtY, SOMEWHAT ); 
for(idx=l; idx<=NPK; idx++) { 

CFuzzySet *CONCLUSION,*RULE_OK, *RULE_N, *RULE_OMIT; 
double okht,tiht, 

okLsp,okRsp,oksp, 

abLsp,abRsp,absp, 

tixb,okxb; 

double concl,cmpti. modLsp, modRsp, insp = pinSfidxj; 

CONCLUSION = ConstructCFuzzySet( CONCL_SZ, concl_x, concl_y, NOHEDGE ); 

RULE_OK = ConstructCFuzzySet( OK_SZ, ok_x, ok_y, NOHEDGE ); 

RULE N = ConstructCFuzzySet( N_SZ, n_x, n_y, NOHEDGE ); 

RULE OMIT - ConstructCFuzzySet( OMIT_SZ, omit_x, omit_y, NOHEDGE ); 

if(pLsp[idx] < (insp/2.0)) modLsp = insp/2.0; 

else modLsp = fmod( pLsp[idx], insp ); 

if(pRsp[idx] < (insp/2.0)) modRsp = insp/2.0; 

else modRsp = fmod( pRsp[idx], insp ); 

if(0.0 = insp) { 

DestructCFuzzySet(CONCLUSION); 
DestructCFuzzySet(RULE_OK); 
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DestructCFuzzySet(RULE_N); 
DestructCFuzzySet(RULE_OMIT); 
rv = 0; 
break; 

5 } 

modLsp /= insp; 
modRsp /= insp; 

okLsp = okSP->membership( okSP, modLsp ); 

okRsp = okSP->membership( okSP, modRsp ); 
1 0 abLsp = abSP->membership( abSP, modLsp ); 

abRsp = abSP->membcrship( abSP, modRsp ); 

oksp = OR(okLsp.okRsp); 

absp = OR(abLsp,abRsp); 

tixb = tiXb->membership( tiXb, pxb[idx] ); 
1 5 okxb = okXb->membership( okXb, pxb[idx] ); 

tiht - tiHT->membership( tiHT, pht[idx] ); 

okht = okHT->membership( okHT, pht[idx] ); 
#ifdefmed(AS_WAS) 

RULE_OK->scale( RULE_OK, AND(okxb,okht)); 
20 #else 

RULE_OK->scale( RULEOK, AND(okxb,OR(okht,oksp)) ); 
#endif 

RULE_N->scale( RULE_N, AND(tixb,OR(okht,AND(oksp,tiht)))); 

RULE_OMIT->scale( RULE_OMIT, AND(tiht,absp) ); 
25 CONCLUSION->cj( CONCLUSION, RULE_OK, DISJ ); 

CONCLUSION->cj( CONCLUSION, RULE_N, DISJ ); 

CONCLUSION->cj( CONCLUSION, RULEOMIT, DISJ ); 

(void)CONCLUSION->fcentroid( CONCLUSION, &concl ); 

cmpti = CONCLUSION->compatIndex( CONCLUSION ); 
30 output[idx][ 1 ] = (float)concl; 

output[idx][2] = (float )cmpti; 
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DestructCFuzzySet(CONCLUSION); 
DestructCFuzzySet(RULE_OK); 
DestructCFuzzySet(RULE_N); 
DestructCFuzzySet(RULE_OMIT); 

} 

DestnictCFuzzySet(okSP); 

DestructCFuzzySet(abSP); 

DestructCFuzzySet(okXb); 

DestructCFuzzySet(tiXb); 

DestructCFuzzySet(okHT); 

DestructCFuzzySet(tiHT); 

return rv; 

} 

#ifdefined(SA) 
void 

main(int argc, char* argv[]) 
{ 

#if 1 

# define HTINC 0.01 

# define SPINC 1.0 

# define XbINC 0.2 
#else 

# define HTINC 0.1 

# define SPINC 1.0 

# define XbINC 0.5 
#endif 

# define OKJDX 0 

# define NJDX 1 

# define OMIT_IDX 2 
double normSP= 12.0; 
double ht s sp, xb, n[4J; 

SUBSTITUTE SHEET (RULE 26) 



WO 98/11258 



PCT/US97/16933 



58 

n[0] = n[l] = n[2] = n[3] = 0.0; 

printffSweeping [ht] from 0.0 to 0.2 in steps of %4.2f\n" > HTINC); 
printf("Sweeping [sp] from 0.0 to %4.1f in steps of %4.2f\n",normSP ? SPINC); 
printf("Sweeping [xb] from 1.0 to 2.0 in steps of %4.2f\n",XbINC); 
5 for(ht = 0.0; ht<=0.2; ht+=HTINC){ 
for(sp = 0.0; sp <= normSP; sp += SPINC) 
for(xb = 1 .0; xb <= 2.0; xb += XblNC) { 
double out[3], ci, dfz; 
char const* CALL; 
1 0 ftn_omitokn( &normSP, 1 ,&ht,&sp,&sp, &xb, out,&ci, &dfz); 

printf("\n{ht,sp.xb}:{%5.2f,%5.2f,%5.2f} -> ", ht,sp.xb); 
printf(" {OK,N,OMIT} : {%3.2f,%3.2f,%3.2f}", 
out[OK_IDX].out[N_IDX],out[OMIT_IDX]); 
if(dfz <= 0.35) { 
15 CALL = " OK";n[0]+= 1.0; 

} 

else if(dfz >= 0.65) { 
CALL = "OMIT"; n[l] += 1.0; 
} 

20 else { 

CALL = " N";n[2]+= 1.0; 

} 

n[3]+= 1.0; 

printf(" %s (dfz=%4.3f,%3.2f)",CALL,dfz,ci); 
25 } 

printf("\n"); 
} 

printf("%3.2f%% OK, %3.2f%% N, %3.2f%% OMIT in %5.0f tests\n", 
n[0]/n[3], n[2]/n[3]. n[l]/n[3], n[3]); 

30 } 

#endif 
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#ifdefined(USE_WSC) 

# include <ab/wsc.h> 
#else 

# include <malloc.h> 
#endif 

#include <basecall/fuzzyseth> 
static double 

membership( struct CFuzzySet const* pcfz, double pt ) { 
double yy; 
if(0 = pcfz->n) 

yy = 0.0; 
else if(pt <= pcfz->x[0]) 

yy = pcfz->y[0]; 
else if(pt >= pcfz->x[pcfz->n-l]) 

yy = pcfz->y[pcfz->n-l]; 
else { 

double yh, yhml; 
int lo ? mid, hi; 
lo = 0; 

hi = pcfz->n-l; 
fori;;) { 

mid = (lo+hi)/2; 
if(mid = lo) 
break; 

else if(pcfz->x[mid] < pt) 

lo = mid; 
else 

hi = mid; 

} 

yh = pcfz->y[hi]; 
yhml =pcfz->y[hi-l]; 
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yy = yhml + (pt - pcfz->x[hi-l])/(pcfz->x[hi] - pcfz->x[hi-l])*(yh - yhml); 
} 

return pcfz->d_j)hedge( yy ); 

} 

5 static double 

compatIndex( struct CFuzzySet const* pcfz ) { 
double ci - 0.0; 
int idx; 

for(idx = 0; idx < pcfz->n; idx++) 
1 0 if(pcfz->y[idx] > ci) ci = pcfz->y[idx]; 
return pcfz->d_phedge( ci ); 

} 

static int 

invalid(struct CFuzzySet const* pcfz ) { 
15 return !pcfz->n; 

} 

static void 

print(struct CFuzzySet const* pcfz, char const* pname) { 
int idx; 

20 printf( H \nCFuzzySet:[%s]\n x y'\pnarne); 
for(idx = 0; idx < pcfz->n; idx++) 
printf("\n %8.31f %8.31f',pcfz->x[idx],pcfz->d_phedge(pcfz->y[idx])); 

printft M \n M ); 

} 

25 static void 

negate( struct CFuzzySet* pcfz ) { 
int idx; 

for(idx = 0; idx < pcfz->n; idx++) 
pcfz->y[idx] = 1.0 - pcfz->y[idx]; 

30 } 

static void 
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scale( struct CFuzzySet* pcfz, double fac ) { 
int idx; 

for(idx = 0; idx < pcfe->n; idx-H-) 
pcfz->y[idx] *= fac; 

} 

static void 

cj_out( double x.double y, double* xn, double* yn, int* nn ) { 
double d; 
if((*nn > 0) && 
(fabs(x-xn[*nn-l]) < l.e-20) && (fabs(y-yn[*nn-l]) < l.e20)) 
return; 
if(*nn> 1) { 
int nml = *nn-l, 
run2 = *nn-2; 

d = xn(nml]*(y-yn[nrn2]) + yn[nml]*(xn[nm2]-x) + x*yn[nm2] - y*xn[nm2]: 
if(fabs(d)< l.e-10) 
—*nn; 

} 

xn[*nn] = x; 
yn[*nn]=y; 
++*nn; 

} 

static void 

intsec( double xl,double yU double x2,double y2, double y3,double y4, 

double *xint, double *yint) { 
double den; 

den = yl -y2-y3 + y4; 

♦xint = (x2*y 1 - xl *y2 - x2*y3 + xl*y4) / den; 

♦yint = (y 1 *y4 - y2*y3) / den; 

} 

static void 
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cj(struct CFuzzySet* pcfz,struct CFuzzySet const* s, LOGICAL_OPERATOR cjdj) { 
int nn. vertexO, vertex 1, use_fiinc_0; 
double *xn, *yn, xl, xr, yOl, yOr, yl 1, y lr, xint, yint; 
double next_xO,next_yO,next_x ! ,next_y 1 ,rightmost_x,frac; 
5 double (*hl )(double y), (*h2)(double y); 
int idx; 

hi = pcfz->d_phedge; 
h2 = s->d_phedge; 
ifiCO = pcfz->n) 
10 return; 

if(0 = s->n) { 
if(NULL!=pcfz->x){ 
#ifdefined(USE_WSC) 
FreeMemory( pcfz->x ); 
15 #else 

free( pcfz->x ); 
#endif 

pcfz->x = NULL; 
} 

20 pcfz->n = 0; 
return; 
} 

#ifdefined(USE_WSC) 
xn = NULL; 

25 (void)fNewMemory((void**)&xn,sizeof(double)*4*(pcfz->n+s->n)); 
#else 

xn = (double*)malloc(sizeof(double)*4*(pcfz->n+s->n)); 
#endif 

ifTNULL = xn) { 
30 pcfz->n = 0; 

if(NULL != pcfz->x) { 
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#ifdefmed(USE_WSC) 

FreeMemoiy( pcfz->x ); 
#else 

free( pcfz->x ); 
5 #endif 

pcfe->x = NULL; 

} 

return; 
} 

10 yn = &xn[ 2 *(pcfz->n+s->n) ]; 

cjdj = (DISJ != cjdj)? CONJ: DISJ; 
if(pcfz->x[0] < s->x[0]) { 
xl = pcfz->x[0]; 
vertexO = 1 ; 
15 vertexl = 0; 
} 

else if(pcfz->x[0] > s->x[0]) { 
xl = s->x[0]; 
vertexO = 0; 
20 vertexl = 1; 
} 

else { 

xl = pcfz->x[0]; 

vertexO = vertexl = 1; 
25 } 

y01 = hl(pcfz->y[0]); 
yll=h2(s->y[0]); 
nn = 0; 

cj_out( xl, (cjdj A (yOI>yll))?y01:yll, xn, yn, &nn ); 
30 if(pcfz->x[pcfz->n- 1 ] >= s->x[s->n- 1 ]) 
rightmostx = pcfz->x[pcfz->n-l]; 
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else 

rightmostjc = s->x[s->n- 1 ]; 
if(vertexO < pcfz->n) { 
next_xO = pcfz->x[vertexO]; 
5 next_yO = hl(pcfz->y[vertexO]); 

} 

else { 

nextjcO = rightmost_x; 
next_yO = hl(pcfz->y[vertexO-l]); 
10 } 

if(vertexl < s->n) { 
nextjd = s->x[vertexl]; 
next_yl = h2(s->y [vertex 1]); 
} 

15 else { 

next_xl = rightmost^; 
next_y 1 = h2(s->y [vertex 1 - 1 ]); 

} 

while((vertexO < pcfz->n) || (vertexl < s->n)) { 
20 if(next_xO < next_x 1 ) 
use_func_0 = 1 ; 
else if(next_xl < next_xO) 
use_func_0 = 0; 
else 

25 use_fiinc_0 = (vertexO < pcfz->n); 
if(l = use_fiinc_0) { 

xr - next_xO; 

yOr = next_yO; 

if(next_xl — xl) 
30 frac - 0.0; 

else 
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frac = (xr-xl)/(next_xl-xl); 
ylr = yll + frac*(next_yl-yll); 
if(++vertexO < pcfz->n) { 

next_xO = pcfz->x[vertexO]; 

next_yO = hl(pcfz->y[vertexO]); 

} 

else 

next_xO = rightmostx; 

} 

else { 
xr = next_xl; 
ylr = next_yl; 
if(next_xO = xl) 
frac = 0.0; 
else 

frac = (xr-xl)/(next_xO - xl); 
yOr = yOl + frac*(next_y0-y01); 
if(++vertexl <s->n) { 

nextxl = s->x[vertexl]; 

next_yl = h2(s->y [vertex 1]); 

} 

else 

nextxl = rightmost_x; 

} 

if((xr>xl) && ((yOI-yll)*(yOr-ylr) < 0.0)) { 
intsec( xl, yOl, xr, yOr, yll, ylr, &xint, Ayint ); 
cj_out( xint, yint. xn, yn, &nn); 
} 

cj_out( xr, (cjdj A (y0r>ylr))? yOr: ylr, xn, yn, &nn ); 
xl = xr; 
yOl = yOr; 
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yll-ylr, 

} 

if(NULL !=pcfz->x) { 
#ifdefined(USE_WSC) 
5 FreeMemory( pcfz->x ); 
#else 

free( pcfz->x ); 
#endif 
pcfz->x = NULL; 
10 } 

#ifdcfined(USE_WSC) 
pcfz->x = NULL; 

(void)fNewMemory((void**)&pcfz->x,sizeof(double)*2*nn); 
#else 

1 5 pcfz->x = (double*)malloc(sizeof(double)*2*nn); 
#endif 
pcfz->y = pcfz->x + nn; 
pcfz->n = nn; 

memcpy( pcfz->x. xn, pcfz->n*sizeof(double) ); 
20 for(idx = 0; idx < nn; idx++) 

pcfz->y[idx] = pcfz->d_punhedge( yn[idx] ); 
if(NULL != xn) { 
#ifdefmed(USE_WSC) 
FreeMemory( xn ); 
25 #eise 

free( xn ); 
#endif 
xn = NULL; 
} 

30 } 

static int 
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fcentroid( struct CFuzzySet const* pcfz, double* pcentroid) { 
int rv = 0; 
*pcentroid - 0.0; 
if(pcfz->n>= 2) { 
5 double numer = 0.0, 
denom = 0.0; 
int idx; 

for(idx = 1 ; idx < pcfz->n; idx++) { 
double t, hyml,hy; 
10 hy = pcfz->d_phedge( pcfz->y[idx] ); 

hyml = pcfz->d_phedge( pcfz->y[idx-l] ); 

t = hyml *(2.0*pcfz->x[idx-l] + pcfz->x[idx]) + 

hy*(pcfz->x[idx-l] + 2.0*pcfz->x[idx]); 
numer += t*(pcfz->x[idx] - pcfz->x[idx-l]); 
15 denom += (pcfz->x[idx]-pcfz->x[idx-l ]) * (hyml + hy); 
} 

if(fabs(denom) > l.e-20) { 

*pcentroid = numer / (3.0*denom); 

rv=l; 
20 } 
} 

return rv; 

} 

static double return_y(double yval) { return yval; } 
25 static double square_y(double yval) { return yval*yval; } 
static double sqrt_y(double yval) { return sqrt(yval); } 
static double contint(double yval) { 
return (yval>=0.5)? sqrt(yval): (yval*yval); 
} 

30 static double contdeint(double yval) { 

return (yval>=sqrt(0.5))? (yval*yval); sqrt(yval); 
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} 

CFuzzySet* 

ConstructCFuzzySet(int npts, double const* xpts, double const* ypts, HEDGE ht) { 

CFuzzySet* pcfz = NULL; 
5 #ifdefined(USE_WSC) 

(void)fNewMemory((void**)&pcfz,sizeof(*pcfz)); 
#else 

pcfz = (CFuzzySet*)malloc(sizeof(*pcfz)); 
#endif 
10 if(NULL != pcfz) { 

if((pcfz->n - npts) < 1) { 
pcfz->n = 0; 



20 



15 



else { 

pcfz->fcentroid = fcentroid; 
pcfe->cj = cj; 
pcfz->invalid = invalid; 
pcfe->membership = membership; 
pcfz->compatlndex = compatlndex; 
pcfz->negate = negate; 
pcfz->print = print; 
pcfz->scale = scale; 
switch(pcfz->d_ht=ht) { 



default: 



25 



pcfz->d Jit = NOHEDGE; 



case NOHEDGE: 



pcfz->d_punhedge = return_y; 
pcfe->d_phedge = return_y; 
break; 



30 



case VERY: 



pcfz->d_phedge = square_y; 



SUBSTITUTE SHEET (RULE 26) 



WO 98/11258 



PCT/US97/16933 



69 

pcfz->d_punhedge = sqrt_y; 

break; 
case SOMEWHAT: 

pcfz->d _phedge = sqrt_y; 

pcfz->d junhedge = squarely; 

break; 
case CONTINT: 

pcfz->d_phedge = contint; 

pcfz->d_punhedge = contdeint; 

break; 

} 

#ifdefmed(USEJWSC) 
pcfz->x = NULL; 

fNewMemory((void**)&pcfz->x,sizeof(double)*2*npts); 
#else 

pcfz->x = (double* )malloc(sizeof(double)*2*npts); 
#endif 

if(NULL==pcfz->x){ 
pcfz->n = 0; 

} 

else { 

pcfz->y = &pcfz->x[ pcfz->n ]; 

memcpy(pcfz->x^pts,npts*sizeof(double)); 

memcpy(pcfz->y,ypts,npts*sizeof(double)); 

} 

} 

} 

return pcfz; 
} 

void 

DestructCFuzzySet(struct CFuzzySet* pcfz) { 
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#ifdefined(USE_WSC) 
if(NULL != pcfz) { 
if(NULL!=pcfz->x){ 
FreeMemory( pcfz->x ); 
5 pcfz->x=NULL; 

} 

FreeMemory( pcfz ); 
} 

#else 

10 if(NULL!=pcfz) { 

if(NULL !=pcfz->x){ 
free( pcfz->x ); 
pcfz->x = NULL; 
} 

15 free(pcfz); 
} 

#endif 
} 

#ifdefined(SA) 
20 static void 

ftn_omitokn( double med_sp ) { 
static double t_SpX[J = {0.0, 0.25, 0.5, 0.75, 1.0}; 
static double nSpY[] = { 1.0, 0.50, 0.0, 0.50, 1.0}; 

# define SPSZ (sizeof(nS P Y)/sizeof(nSpY[0])) 
25 static double loSnX[] « {1.0, 1.4, 1.8); 

static double loSnY[] = {1.0, 0.5, 0.0}; 

# define LoSNSZ (sizeof(loSnX)/sizeof(loSnX[0])) 
static double loHtX[] = {0.03,0.05,0.07}; 

static double loHtYQ = { 1 .00,0.50,0.00} ; 
30 # define LoHTSZ (sizeof(loHtX)/sizeof(loHtX[0])) 
static double okxy[] = {0.0, 1.0}; 
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CFuzzySet *vNSP, *swNSP, *vLoSN, *swLoSN, *vLoHT, *swLoHT; 
double ht, Lsp,Rsp, sn, nSpX[ SPSZ ]; 
int idx; 

for(idx = 0; idx < SPSZ; idx++) 
5 nSpXpdx] = t_SpX[idx]*raed_sp; 

vNSP = ConstructCFuzzySet( SPSZ, nSpX, nSpY, VERY ); 
swNSP = ConstructCFuzzySet( SPSZ, nSpX, nSpY, SOMEWHAT ); 
vLoSN = ConstructCFuzzySet( LoSNSZ, loSnX, loSnY, VERY ), 
swLoSN = ConstructCFuzzySet( LoSNSZ, loSnX, loSnY, SOMEWHAT ), 
1 0 vLoHT = ConstructCFuzzySet( LoHTSZ, loHtX, loHtY, VERY ); 

swLoHT = ConstructCFuzzySet( LoHTSZ, loHtX, loHtY, SOMEWHAT ); 

# define HTINC 0.01 

# define SPINC 1.0 

# define SNINC 0.25 

15 for(ht = 0.02; ht <= IoHtX[LoHTSZ-l]; ht += HTINC) { 
for(Lsp = nSpX[0]; Lsp <= med_sp; Lsp += SPINC) { 
double modLsp = fmod(Lsp,med_sp); 
for(Rsp = nSpX[0]; Rsp <= med_sp; Rsp += SPINC) { 
double modRsp = fmod(Rsp,med_sp); 
20 for(sn = loSnX[0]; sn <= loSnX[LoSNSZ-l]; sn += SNINC) { 

CFuzzySet *RULE1, *RULElb, *RULE2, *RULE3, *RULE3b; 
double vloHT,swloHT,nswloHT, 
vnLSP,vnRSP,vnSP,nvnSP, 
swnLSP,swnRSP,swnSP,nswnSP, 
25 swloSN,nswloSN,vloSN, 
out[3]; 
double maxci = 0.0, ci[3]; 
intjdx, maxjdx = 0; 

RULE1 = ConstructCFuzzySet( 2, okxy, okxy,NOHEDGE ); 
30 RULE lb = ConstructCFuzzySet(2, okxy, okxy,NOHEDGE ), 

RULE2 = ConstructCFuzzySet( 2, okxy, okxy.NOHEDGE ); 
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RULE3 = ConstructCFuzzySet( 2, obey, okxy.NOHEDGE ), 
RULE3b = ConstructCFuzzySet(2, okxy, okxy,NOHEDGE ), 
vnLSP = vNSP->membership( vNSP, modLsp ); 
vnRSP = vNSP->membership( vNSP, modRsp ); 
5 vnSP = AND(vnLSP,vnRSP); 

swnLSP = swNSP->membership( swNSP, modLsp ); 
swnRSP = swNSP->membership( swNSP, modRsp ); 
swnSP = AND(swnLSP,swnRSP); 
vloSN = vLoSN->membership( vLoSN, sn ); 
10 swloSN = swLoSN->membership( swLoSN, sn ); 

vloHT = vLoHT->membership( vLoHT, ht ); 
swIoHT = swLoHT->membership( swLoHT, ht ); 

nvnSP = (1.0-vnSP); 
nswnSP = (1.0 - swnSP); 
1 5 nswloSN = (1.0- swloSN); 

nswloHT = (1 .0 - swloHT); 
RULE1 ->scale( RULE1 , AND( nswloSN, vnSP ) ); 
RULElb->scale( RULE lb, AND( nswloSN, nswloHT ) ); 
RULE1 ->cj(RULEl,RULEl b,DISJ); 
20 RULE2->scale( RULE2, AND(vloHT,nswnSP ) ); 

RULE3->scale( RULE3, AND(vloSN,vnSP) ); 
RULE3b->scale( RULE3b, AND(vloSN,nswloHT) ); 
RULE3->cj(RULE3,RULE3b,DISJ); 
ifi; 1 =RULE 1 ->fcentroid(RULEl ,&out[0])) 
25 out[0] - RULE 1 ->membership(RULE 1 ,out[0]); 

if( 1 =RULE3->fcentroid(RULE3,&out[ 1 ])) 
out[l] = RULE3->mcmbcrship(RULE3,out[l]); 
if( 1 =RULE2->fcentroid(RULE2,&out[2])) 
out[2] = RULE2->membership(RULE2,out[2]); 
30 printf("\n{ht,Lsp,Rsp,sn>:{%3.21f,%2.01f,%2.01f,%3.11f} -> ", 

ht,Lsp,Rsp,sn); 



SUBSTITUTE SHEET (RULE 26) 



WO 98/11258 



PCT7US97/16933 



73 

printft" {OK,N,OMIT} : {%3.21f,%3.21f,%3.21f} ", out[0],out[ 1 ],out[2]); 
ci[0] = RULEl->compatIndex(RULEl); 
ci[l] = RULE2->compatIndex(RULE2); 
ci[2] = RULE3->compatIndex(RULE3); 
5 forO'dx = 0; jdx < 3; jdx++) 

if(maxci < cijjdx]) maxci = ci[maxjdx=jdx]; 
printf(" (CI=%3.21f)",maxci); 

#ifO 

if(maxci < 0.5) { 

10 printf("\n\tLow Compatability Index: CLASS #%d w MAX=%3.21f 

++maxjdx,maxci); 

printf("\n\tvnLSP=%3.21f vnRSP=%3.21f vnSP=%3.21f nvnSP=%3 .21f \ 
vnLSP,vnRSP,vnSP,nvnSP); 

printf("\n\tswnLSP=%3.2If swnRSP=%3.21f swnSP=%3 .21f nswnSP=%3.21f \ 
15 swnLSP,swnRSP,swnSP,nswnSP); 

printf("\n\tvloHT=%3.21f swloHT=%3.21f nswloHT=%3.21f 
vloHT,swloHT,nswloHT); 

printf("\n\tvloSN=%3.21f swloSN=%3.21f nswloSN=%3 .2lf 
vloSN,swloSN,nswloSN); 
20 printf("\n\tRULEl : tswloSN & vnSP"); 

printf("\n\tRULElb: IswIoSN & IswloHT"); 

printf]["\n\tRULE2 : vloHT & IswnSP"); 

printf("\n\tRULE3 : vloSN& vnSP"); 

printfl: M \n\tRULE3b: vloSN & IswloHT"); 
25 RULE 1 ->print(RULE 1 ."RULE 1"); 

RULE lb->print(RULE 1 b,"RULEl b"); 

RULE2->print(RULE2,"RULE2"); 

RULE3->print(RULE3,"RULE3"); 

RULE3b->print(RULE3b,"RULE3b"); 
30 vNSP^printfvNSP.'VNSP"); 

swNSP->print(swNSP, M swNSP"); 
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vLoSN^prin^vLoSN/'vLoSN"); 
swLoSN->print(swLoSN,"swLoSN"); 
vLoHT->print(vLoHT, ,, vLoHT"); 
swLoHT->print(swLoHT,"swLoHT"); 
5 } 
#endif 

DestructCFuzzySet(RULEl ); 
DcstructCFuzzySet(RULEl b); 
DestructCFuzzySet(RULE2); 
10 DestructCFuzzySet(RULE3); 

DestructCFuzzySet(RULE3b); 
} 

} 

} 

15 printf( n \n"); 
} 

DestructCFuzzySet(vNSP); 
DestructCFuzzySet(swNSP); 
DcstructCFuzzySet(vLoSN); 
20 DestructCFuzzySet(swLoSN); 
DestructCFuzzySet(vLoHT); 
DestructCFuzzySel(swLoHT); 

} 

void 

25 main(int argc, char* argv[]) { 
double normSP; 
if(2!=argc){ 
fprintf(stderr,"usage: %s normSpacing\n",argv[0]); 
exit(l); 
30 } 

sscanf(argv[l],"%lf\&normSP); 
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ftn_omitokn(normSP); 
exit(0); 

} 

#endif 

5 

* FILE: CorrCoefxxx 
♦AUTHOR: Andy Marks 

* COPYRIGHT (c) 1996, University of Utah 
10 */ 

#include <nrc/nr.hxx> 
double 

corrcoef( float const* xp, float const* yp, int N ) 

{ 

1 5 struct S { double xBar, yBar, xy2, xx2, yy2; } s; 
int idx; 

if(N<2) return 0.0; 
s.xBar = s.yBar = 0.0; 
for(idx=0 ; idx < N; idx-H-) { 
20 s.xBar += double(xp[idx]); 
s.yBar += double(yp[idx]); 
} 

s.xBar /= double(N); 
s.yBar /=double(N); 
25 s.xy2 = s.xx2 = s.yy2 = 0.0; 
for(idx=0 ; idx < N; idx++) { 
double dx, dy; 

dx = (double(xp[idx]) - s.xBar); 
dy = (double(yp[idx]) - s.yBar); 
30 s.xy2 += dx*dy; 

s.xx2 += dx*dx; 
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s.yy2 += dy*dy; 

} 

s.xy2/=(double(N)-1.0); 
s.xx2/=(double(N)-1.0); 
5 s.yy2/=(double(N)-1.0); 
ifl[0.0=s.xx2 1| 0.0=s.yy2) 
return 0.0; 
else 

return s.xy2/::sqrt(s.xx2*s.yy2); 

10 } 

j********************************************* ************** *********** 

* FILE: fuzzyseth 

* Copyright © 1996, University of Utah 
15 */ 

#if !defined(_FUZZYSET_HJ 
#define _FUZZYSET_H_ 
#include <math.h> 
#include <stdio.h> 
20 #include <string.h> 
^include <malloc.h> 

#if defined( cplusplus) 

extern "C" { 
#endif 

25 typedef enum { DISJ, CONJ } LOGICAL_OPERATOR; 

typedef enum { NOHEDGE, SOMEWHAT, VERY, CONTTNT } HEDGE; 

#defme AND(vl,v2) ((vl<v2)?(vl):(v2)) 

#defme OR(vl,v2) ((vl>v2)?(vl):(v2)) 

#defmeNOT(v) (l-(v)) 
30 typedef struct CFuzzySet { 

int (*fcentroid)( struct CFuzzySet const* pcfz, double* pcentroid ); 
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void (*cj)( struct CFuzzySet* pcfz,struct CFuzzySet const* ps t LOGICAL_OPERATOR cjdj 

); 

int (*invalid)( struct CFuzzySet const* pcfz ); 

double (*membership)( struct CFuzzySet const* pcfz, double pt ); 

double (*compatIndex)( struct CFuzzySet const* pcfz ); 

void (*negate)( struct CFuzzySet* pcfz ); 

void (*print)( struct CFuzzySet const* pcfz, char const* pname ); 

void (*scale)( struct CFuzzySet* pcfe, double fac ); 

int n; 

double* x; 
double* y; 
HEDGE d_ht; 

double (*d j>hedge)(double yval); 
double (*d_punhedge)(double yval); 
} CFuzzySet; 
CFuzzySet* 

ConstructCFuzzySet(int npts,double const* xpts,double const* ypts,HEDGE ht); 
void DestructCFuzzySet(CFuzzySet* pcfz); 
#if defined( cplusplus) 

} 

#endif 
#endif 

****** *************************************** 

* FILE: gaussj.cxx 

* TYPIST: Andy Marks 

* Copyright © 1996 University of Utah 
*/ 

#include <math,h> 
#include <nrc/nrutil.hxx> 

tfdefine SWAP(a,b) { temp=(a); (a)=(b); (b)-temp; } 
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#if !defmed(SA) 
void 

gaussj(float** a, int n, float** b, int m) 

{ 

5 int *indxc, *indxr, *ipiv; 
int i, icol, irowjJc,el,ll; 
float pivinv, temp; 
indxc = ivector(l,n); 
indxr = ivector(l,n); 
10 ipiv = ivector(l,n); 

forO=l;j<=n;j-H-)ipiv[j] = 0; 
for(i=l;i<=n;i++) { 

float big = O.Of; 

for(j=l;j<=n;j++) 

15 if(l !=ipivD]) 

for(k=l;k<=n;k++) { 

if(0=ipiv[k]) { 
if(fabs(aD][k])>=big){ 
big = float(fabs(aD][k])); 
20 irow=j; 

icol = k; 
} 

} 

else if(ipiv[k] > 1) 
25 nrerror( "gauusj: Singular Matrix- 1 " ); 

} 

++(ipiv[icol]); 
if(irow != icol) { 
for(el=l ;el<=n;el++) SWAP(a[irow][el],a[icol][el]); 
30 for(el=l ;el<=m;el-H-) SWAP(b[irow][el],b[icol][el]); 

} 
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indxr[i]=irow; 
indxc[i]=icol; 
if(0.0f=a[icol][icol]) 
nrerror("gaussj: Singular Matrix-2"); 
5 pivinv= 1.0f/a[icol][icol]; 
a[icol][icol]=1.0f; 

for(el=l;el<=n:el++) a[icol][el] *= pivinv; 
for(el=l;el<=m;el++) b[icol][el] *= pivinv; 
for(ll=l;ll<=n; 11++) 
10 if(ll != icol) { 

float dum = a[ll][icol]; 

a[ll][icol]=0.0f: 

for(el=l;el<=n:el++) a[ll][el] -= a[icol][el]*dum; 
for(el=l;el<=m:el++) b[ll][el] -= b[icol][el]*dum; 
15 } 
} 

for(el = n; el>=l;el--){ 
ifflndxr[el] != indxc[el]) 
for(k=l;k<=n;k++) 
20 SWAP( a[k][indxr[el]], a[k][indxc[el]] ); 

} 

free^ivectoi^ipiv, 1 ,n); 
free_i vector(indxr, 1 ,n); 
free_ivector(indxc, 1 ,n); 

25 } 

tfendif 

#ifdefined(SA) 
#include <stdio.h> 
^include <nrc/nr.hxx> 
30 int 

main(int argc, char* argv[]) 
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{ 

float **a, **b, **aorig, **borig, **aprod, **bprod; 
int r, c, s; 

a = matrix(l,4,l,4); 
5 aorig = matrix( 1 ,4, 1 ,4); 
aprod = matrix(l,4,l,4); 
b = matrix(l,4,1.2); 
borig = matrix( 1,4,1,2); 
bprod = matrix( 1 ,4, 1 ,2); 
10 a[l][l] = 7.0f; a[l][2] = 8.0f; a[l][3] = 9.0f; a[l][4] = lO.Of; 
a[2][l] = 6.0f; a[2][2] = l.Of; a[21[3] = 2.0f; a[2][4] = 1 1 -Of; 
a[3][l]= 5.0f; a[3][2] = 4.0f; a[3][3] = 3.0f; a[3][4] = 12.0f; 
a[4][l] = 16.0f; a[4][2] = 15.0f; a[4][3] = 14.0f; a[4][4] = 13.0f; 
for(r=l;r<=4; r++) 
15 for(c=l;c<=4;c++) 
aorig[r][c] = a[r][c]; 
b[l][l]= 50.0f;b[l][2] = 40.0f; 
b[2][1] = 122.0f; b[2][2] = 96.0f; 
b[3][l] = 194.0f; b[3][2] = 152.0f; 
20 b[4][l ] = 266.0f; b[4][2J = 2O8.0f; 
for(r=l;r<=4;r++) 
for(c=l;c<=2; C++) 
borig[r)[c] = b[rJ[c]; 
gaussj( a, 4, b, 2); 
25 printf( M inv(A):\n"); 
for(r=l;r<=4;r++) { 
for(c=l;c<=4;c++) 
printf("%9.4f" ? a[r][c]); 
printfC'Xn"); 
30 } 

printf("S:\n M ); 
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for(r=l:r<=4;r++){ 

printf("\t"); 
for(c=l;c<=2;c-H-) 

printf("%9.4f^lr][c]); 

printfC'Nn"); 

} 

for(r=l;r<=4;r++) 
for(c=l;c<=4;c++) { 
float sum = O.Of; 
for(s=l;s<=4;s++) 
sum += a[r][s]*aorig[s][c]; 
aprod[r][c] = sum; 
} 

printf("inv(A)*A=I?\n"); 
for(r=l:r<=4;r++) { 

printf("\t"); 

for(c=l;c<=4;c++) 
printf(["%9.4f",aprod[r][c]); 

printf("\n"); 

} 

for(r=l:r<=4;rH-) 
for(c=l;c<=2;c++) { 
float sum = O.Of; 
for(s=l;s<=4;s++) 
sum += aorig[r][s]*Ms][c]; 
bprod[r][c] = sum; 
} 

printf("A*S=B?\n"); 
for(r=l;r<=4;r-H-) { 

printf("\t"); 

for(c=l;c<=2;c++) 
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printf("%9.4f B ,bprod[r][c]); 
printf("\n"); 
} 

for(r=l;r<=4;r++) 
5 for(c=l;c<=2;c++) { 
float sum = O.Of; 
for(s=l;s<=4;s-H-) 
sum += a[r][s]*borig[s][c]; 
bprod[r][c] = sum; 
10 } 

printf("inv(A)*B=S?\n"); 
for(r=l;r<=4;r++) { 

printfi("\t"); 

for(c=l;c<=2;c++) 
1 5 printf("%9.4f ^bprodWfc]); 

printf( M \n"); 

} 

free_matrix(a, 1 ,4,1 ,4); 
free_matrix(aorig, 1 ,4, 1 ,4); 
20 free_rnatrix(aprod.l.4,l,4); 
free_matrix(b, 1 ,4, 1 ,2); 
free_matrix(borig,l .4, 1 ,2); 
free_matrix(bprod. 1 ,4, 1 ,2); 
return 0; 

25 } 

#endif 

* FILE: mb.cxx - stands for "MyBusiness" 
30 * AUTHOR: Andy Marks 

* COPYRIGHT (c) 1996, University of Utah 
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*/ 

#include <basecall/mb.hxx> 
#undefAS_WAS 
static int const BGNLFTR=13, 
5 ENDLFTR=23; 

MB::MB( int fluor ) : minFreq_(l .0), minFreqLane(O), lastFbwJO) 

{ 

bdStatics( fluor ); 
if(fluor) { 

10 static double const CROSSOVER = 0.23; 
static int const MAXFBW = 100; 
double K = 2.0*::sqrt(::log(CROSSOVER) / -0.5); 
for(int pdx=SMLGAP;pdx<=BIGGAP;pdx++) { 
fbwlutjpdx] = int( 0.5 + (K/double(pdx))*double(NPTS)/(2.0*NR_PI) ); 
1 5 if(fbwlut_[pdx] > MAXFBW) 

fbwlutjpdx] = MAXFBW; 

} 

} 

else { 

20 static double const CROSSOVER = 0.20; 
static int const MAXFBW = 1 00; 
double K = 2.0+::sqrt(::log(CROSSOVER) / -0.5); 
for(int pdx=SMLGAP;pdx<=BIGGAP;pdx++) { 
fbwlut_[pdx] = int( 0.5 + (K/double(pdx))*double(NPTS)/(2.0*NR_PI) ); 
25 if(fbwlut_[pdx] > MAXFBW) 

fbwlut_[pdx] = MAXFBW; 

} 

} 

} 

30 MB::~MB() 
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} 

MB::MB( MB const& rhs ) : minFreqJl.O), minFreqLaneJO), lastFbwJO) 
{ 

♦this = rhs; 

5 } 

MB const& 

MB::operator=(MB const& rhs) 
{ 

int idx, jdx; 
1 0 lastFbw_ = rhs.lastFbwj 

for(idx=jdx= 1 ;idx<=NPTS;idx+=l jdx+=2) { 
ivec Jjdx] = rhs.ivecjjdx]; ivecjjdx+l ] - rhs.ivecjjdx+l ]; 
imagjjdx] = rhs.imagjjdx]; imagjjdx+l ] = rhs.imagjjdx+l ]; 
lifter_[idx] = rhs.lifter_[idx]; 
1 5 filter Jidx] = rhs.filterjidx]; 

ww_[idx] = rhs.ww_[idx]; 

} 

minFreq_ = rhs.minFreq_; 
minFreqLane_ = rhs.minFreqLane_; 
20 for(idx=0;idx<5;idx++) 

bandAmplJidx] = rhs.bandAmplJidx); 
for(idx=SMLGAP;idx<=BIGGAP;idx++) 

fbwlut_[idx] = rhs.fbwlut_[idx]; 
return *this; 

25 } 
void 

MB::bdStatics( int fluor ) 
{ 

double bgnpt, endpU m, b; 
30 int sdx; 

lastFbw_ = 0; 
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lifterjl] = lifter_[NPTS] = 0.0; 

lifter_[NPTS/2+l] = 1.0; 

if(fluor) { bgnpt = 7; endpt = 24; } 

else { bgnpt = BGNLFTR; endpt = ENDLFTR; } 

m = NR_PI/(endpt-bgnpt); 

b = NR_PI/2.0 - m*endpt; 

for(sdx=2;sdx<=NPTS/2;sdx++) { 

if(sdx < bgnpt) 
lifter_[sdx]=0.0; 

else if(sdx >= bgnpt && sdx <= endpt) 
lifterjsdx] = 0.5 * (1.0 + ::sin( m*double(sdx) + b )); 

else 

lifter_[sdx] = 1.0; 

lifter_[NPTS-sdx+2] = lifter_[sdx]; 
} 

for(intj=l;j<=NPTS;j++) { 

ww_[j] = 2.0*NR_PI/double(NPTS) * (doubleQ) - doubie(NPTS/2)); 
wwJJ] *= ww_(j]; 

} 

} 

int 

MB::remune() const 
{ 

#ifl 
return 1; 
#else 

static int const OK YEAR =96; 
time_t now = time((time_t*)NULL); 
struct tm* ptm = localtime( &now ); 
if(OK_YEAR != ptm->tm_year) 
return 0; 
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else if(ptm->tm_mon>=3 && ptm->tm_mon<= 1 1) 

return 1; 
else 

return 0; 
5 #endif 
} 

*****+******* ****** ***************** 

# FILE: mb.hxx 

10 * AUTHOR: Andy Marks 

# COPYRIGHT (c) 1996, University of Utah 
*/ 

#ifndef_MB_HXX_ 
#define_MB_HXX_ 
1 5 #include <basecall/Pkdet.hxx> 
#include <basecall/seqrdr.hxx> 
#include <basecall/SegRead.hxx> 
#ifdefined(sun)||definedLWIN32) 

# define nint(v) int(((v)>0)?((v)+0.5):((v)-0.5)) 
20 tfendif 

#ifdefined(_WIN32) 
double drand48(); 

# define isnan _isnan 

# define finite _finite 
25 #endif 

static const int SMLGAP = 1 , 
BIGGAP = 33; 

class MB 
{ 

30 public: 

enum Status { STSJJNINITD, STSJNITD, STS_NO_MEM }; 
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MB( int fluor ); 
MB( MB const& rhs ); 
MB const& operator=(MB const& rhs); 
~MB(); 
5 int remune() const; 
double minFreq_, 

bandAmpl_[5]; 
int minFreqLane_; 
double ivecjl+2*NPTS), 
10 iraag_[l+2*NPTS], 
filter_[l+NPTS], 
Iifter_[l+NPTS], 
ww_[l+NPTS]; 
int lastFbw_; 
15 int fbwlutJl+BIGGAP]; 
private: 
void bdStatics( int fluor ); 
Status status_; 

>; 

20 #endif 

* FILE: Metrics.hxx 
♦AUTHOR: Andy Marks 
25 * COPYRIGHT (c) 1 996, University of Utah 
*/ 

#ifndef _METRICS_HXX_ 
tfdefine _METRICS_HXX_ 
#ifdefined(WIN32) 
30 class _declspec( dllexport) BandStat 
#else 
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#endif 

{ 

public: 
5 BandStat(); 

-BandStatO {}; 

void ntnr( int v ) { ntnr_ = v; } 
void posn( int v ) { posn_ = v; } 
void hght( float v ) { hght_ = v; } 

1 0 void lowv( float v ) { lowv_ = v; } 
void xbnd( float v ) { xbnd_ = v; } 
void shap( float v ) { shap_ = v; } 
void widt( float v ) { widt_ = v; } 
void lgap( float v ) { lgap_ = v; } 

1 5 void sgap( float v ) { sgap_ - v; } 
void buzz( float v ) { buzz_ = v; } 
void bbgn( int v) { bbgn_ = v; } 
void bend( int v) { bend_ = v; } 
void insr( int v ) { insr_ = v; } 

20 void cali( char v ) { call_ = v; } 
void qual( float v ) { qual_ = v; } 
int ntnr() const { return ntnr_; } 
int posn() const { return posn_; } 
float hght() const { return hght_; } 

25 float lowv() const { return lowv_; } 
float xbnd() const { return xbnd_; } 
float shap() const { return shap_; } 
float widt() const { return widt_; } 
float lgap() const { return lgap_; } 

30 float sgap() const { return sgap_; } 
float buzz() const { return buzz_; } 
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int bbgn() const { return bbgn_; } 

int bend() const { return bend_; } 

int insr() const { return insr_; } 

int awid() const { return bend_-bbgn_+l ; } 

float qual() const { return qual_; } 

float StadenQual() const { return 98.0f*qual_+l. Of; } 

char call() const { return call_; } 

void debug() const; 
private: 
int ntnr_; 
int posn_; 
int bbgn_; 
int bend_; 
int insr; 
float hght_; 
float lowv_; 
float xbnd_; 
float shap_; 
float buzz_; 
float widt_; 
float lgap_; 
float sgap_; 
char call_; 
float qual_; 

}; 

#ifdefined(WIN32) 

class _declspec( dllexport) BandStatArray 
#else 

class BandStatArray 
#endif 

{ 
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public: 
BandStatArray(); 

BandStatArray( BandStatArray const& rhs ); 
BandStatArray const& operator^ BandStatArray const& rhs ); 
5 -BandStatArrayO; 
void init( int len ); 

void ntnr( int idx, int v ) { bandj idx ] .ntnr(v); } 
void posn( int idx, int v ) { band_[ idx ].posn(v); } 
void bbgn( int idx, int v ) { band_[ idx ].bbgn(v); } 

1 0 void bend( int idx, int v ) { bandj idx ].bend(v); } 
void insr( int idx, int v ) { band J idx ].insr(v); } 
void hght( int idx, float v ) { bandj idx ].hght(v); } 
void lowv( int idx, float v ) { bandj idx ] Jowv(v); } 
void xbnd( int idx, float v ) { bandj idx ].xbnd(v); } 

15 void shap( int idx, float v ) { bandj idx ].shap(v); } 
void buzz( int idx, float v ) { bandj idx ].buzz(v); } 
void widt( int idx, float v ) { band_[ idx ].widt(v); } 
void lgap( int idx, float v ) { bandj idx ].lgap(v); } 
void sgap( int idx, float v ) { bandj idx ].sgap(v); } 

20 void qual( int idx, float v ) { bandj idx ].qual(v); } 
void call( int idx, char v ) { bandj idx ].call(v); } 
void append( BandStatArray const& rhs, int oseiend, int nselbgn ); 
int ntnr( int idx ) const { return bandjidx].ntnr(); } 
int posn( int idx ) const { return band Jidx].posn(); } 

25 int bbgn( int idx ) const { return bandjidx].bbgn(); } 
int bend( int idx ) const { return bandjidx].bend(); } 
int insr( int idx ) const { return band_[idx].insr(); } 
float hght( int idx ) const { return band Jidx].hght(); } 
float lowv( int idx ) const { return band Jidx].lowv(); } 

30 float xbnd( int idx ) const { return band Jidx].xbnd(); } 
float shap( int idx ) const { return bandjidx].shap(); } 



SUBSTITUTE SHEET (RULE 26) 



WO 98/11258 



PCT/US97/16933 



91 

float buzz( int idx ) const { return band_[idx].buzz(); } 
float widt( int idx ) const { return band_[idx].widt(); } 
float lgap( int idx ) const { return band_[idxj.lgap(); } 
float sgap( int idx ) const { return band _[idx].sgap(); } 
5 float qual( int idx ) const { return band_[idx].qual(); } 

float StadenQual(int idx) const { return band_[idx].StadenQual(); } 
char call( int idx ) const { return band_[idx].call(); } 
int len() const { return len_; } 
BandStat& band( int idx ) { return band_[idx]; } 
10 BandStat const& band( int idx ) const { return band_[idx]; } 
void debugQ const: 
private: 
int len_; 

BandStat* band_; 

15 }; 

#endif 

/******* ********************* 4^******^ 

* FILE: nfeeder.cxx 
20 * AUTHOR: Andy Marks 

* COPYRIGHT (c) 1996, University of Utah 
»/ 

#include <basecall/mb.hxx> 
#include <nrc/Centroid.hxx> 
25 static const int 

MAXPASSES = 6, 
INPUTSTEP = 1900, 
ENDPT = (INPUTSTEP+NPTS)/2, 
FROMEND =(NPTS-INPUTSTEP)/10; 
30 static const int PERCENTILE = 40; 
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static int 

i_cmp( void const* el, void const* e2 ) 
{ 

return (*(int*)el) - (*(int*)e2); 

5 } 
int 

SegRead::fBandSpace( int& spacing ) const 
{ 

BandStatArray const& m = bandStats(); 
10 int len. idxO,idxl; 
int* diff; 
if(m.len()<2) 
return 0; 
len = int( m,len()-l ); 
15 if(NULL = (diff = ::ivector( 1, len ))) 
return 0; 

for(idxO=0,idxl=l; idx0<len; idx0++ j idxl++) 
diffjidxl] = m.posn(idxl) - m.posn(idxO); 
::qsort( &diffjl], len, sizeof(*diff), i_cmp ); 
20 spacing = diffI(PERCENTILE*len)/100]; 
::free_ivector( diff, 1, len ); 
return 1; 

} 

int 

25 Wvfm: :nfeeder( RdrOut& output, int fVb ) 
{ 

int PASSES, rawpts, fbw=82, spacing; 

int rv = 0; 

srand(O); 

30 output.qualctrl().startTimer(); 
if(l != preproc()) { 

SUBSTITUTE SHEET (RULE 26) 



WO 98/11258 



PCT/US97/16933 



93 

if(fVb) ::fprintf(stderr,"preproc() failed\n"); 

return 0; 

} 

output.iSl(bgni() ); 
rawpts = endi()-bgni()+l ; 
if(rawpts<NPTS) 
PASSES = 1; 
else { 

PASSES = int(ceil(float(rawpts-NPTS)/float(INPUTSTEP)) + 1); 
if(PASSES > MAXPASSES) 
PASSES = MAXPASSES; 

} 

SegRead segrd( bgni(), FLUOR=ds() ); 
for(int passNr=l; passNr <= PASSES; passNr++) { 
if(0 = (rv = nreader( fbw, 0, segrd ))) { 

if(fVb) ::fprintf(stderr,"nreader(0) failed,passNr=%d\n",passNr); 
break; 

} 

else if(0 = (rv = segrd.fBandSpace( spacing ))) { 
if(fVb) : :fprintf(stderr,"fBandSpace(%d) failed,passNr=%d\n",passNr); 
break; 
} 

else { 

if(l !=passNr) { 

int ospace = output.qualctrl().bspac(passNr-2); 
if(spacing < ospace) 
spacing = ospace; 

} 

segrd.fbwlut( spacing, fbw ); 
if(0 = (rv = nreader( fbw, 1 , segrd ))) { 
if(fVb) ::fprintf(stderr."nreader(l ) failed,passNr=%d\n",passNr); 
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break; 
} 

else if(0 = (rv = output.add( passNr, fbw, spacing, segrd ))) { 
if(fVb) ::fprintf(stderr,"output.add()failed,passNr=%d\n",passNr); 
5 rv= 1; 

break; 
} 

} 

int newStart = segrd.iSl()+INPUTSTEP; 
1 0 if((newStart+NPTS) >= endi()) 
newStart = endi()-NPTS; 
segrd.iSl( newStart ); 

} 

output.qualctrl().stopTimer(); 
15 return rv; 
} 

********************************************** 

* FILE: nr.hxx 

20 * TYPIST: Andy Marks 

* Copyright © 1996 University of Utah 
*/ 

#ifndef_NR_HXX_ 
#define_NRHXX_ 
25 include <math.h> 

^include <nrc/nrutil.hxx> 

#if defined( cplusplus) 

static double const NR_PI = acos(-l.O); 
#else 

30 # define NR_PI acos(-l .0) 
#endif 
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enum PFWGHT { INSTRUMENTAL-!, NO_WEIGHTING, STATISTICAL }; 
DLLexport int polfit(float const* x, float const* y, float const* sigmaY, 
int NPTS, int NTERMS, PFWGHT wm, 
float coef[], float& chisq); 
5 DLLexport int ilinreg(int const* x, int const* y, int n, float coef[3], float* rr ); 
DLLexport int linreg(float const* x, float const* y, int n, float coef[3], float* rr ); 
DLLexport double corrcoef(float const* vl, float const* v2, int N); 
DLLexport int iquadratic(int const* x, int const* y, int n, float coefpJ]); 
DLLexport int dquadratic(double const* x, double const* y, int n, float coef[4]); 
10 DLLexport void gaussj(float** a, int n, float** b, int m); 

DLLexport void spline(float const x[], float const y[], int n, float ypl, float ypn ? float y2[]); 
DLLexport void splint(float const xa[],float const ya[],float const ya2[],int n, 
float x, float *y); 

DLLexport void dfourl (double data[], unsigned long nn, int isign); 
15 #endif 

* FILE: nreader.cxx 
♦AUTHOR: Andy Marks 

* COPYRIGHT (c) 1996, University of Utah 
20 */ 

#include <basecall/mb.hxx> 
#ifdefined(_ WIN32) 
double 
drand48() 
25 { 

return double(randO) / double(RAND_MAX); 

} 

#endif 
void 

30 randPadd( double** rm, int rows, int cols ) 
{ 
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for(int r=l; r<=rows; t++) 
for(int c=l; c<=cols; C++) 
rm[r][c] = drand48(); 

} 

5 double 

dmean( double* v, int b, int e ) 

{ 

double sum = 0.0; 
for(int idx=b; idx<=e; idx++) 
10 sum+=v[idx]; 

return sum/double(e-b+l); 

} 

int 

d_cmp( void const* el , void const* e2 ) 
15 { 

double vl = (*(doub!e const*)el), 

v2 = (*(double const*)e2); 
if(vl>v2) 
return 1; 
20 elseif(vl<v2) 
return -1; 
else 

return 0; 

} 

25 int 

Wvfm::nreader( int FBW, int pass2, SegRead& sr ) 
{ 

Wvfm bdproc( NPTS, 4, lnordr(), ds(), method() ); 
int have, endpt, iSN, scnl, lane, ptl = sr.iSl(); 
30 iSN = endi(); 

endpt = ptl+NPTS-1; 
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if(endpt <= iSN) { 
have = NPTS; 
for(lane=l ;lane<=4;lane++) 
for(scnl=pt 1 ;scnl<=endpt;scnl++) 
5 bdproc.sc_la_set(scnl-ptl+l Jane,sc_la(scnl,lane)); 

} 

else { 

double **padding, *v; 
int need = endpt - iSN, q5, q95; 
10 U undef PADMATRIX 

# define PADVECT 

# ifdefined(PADVECT) 
intk=l,*pl = &k, NPLN=1; 

# else 

15 int *pl = &lane, NPLN=4; 

# endif 

if(NULL == (padding = ::dmatrix(l,lO+need,l,NPLN))) { 
status, = STS_NO_MEM; 
return 0; 
20 } 

::randPadd( padding, 10+need, NPLN ); 
have = iSN-ptl+l; 
q5 = int(nint(float(have)/20.0f)); 
q95 = have-q5; 
25 v = : :dvector( 1 , have ); 

for(lane=l; lane<=4; lane++) { 
double ht; 

for(scnl=l; scnl<=have; scnl++) { 
v[scnl] = sc_la( ptl+scnl-1 . lane ); 
30 bdproc.sc_la_set(scnl,lane,v[scnl]); 
} 
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::qsort( &v[l], have, sizeof(double), dcmp ); 
ht = v[q95]-v[q5]; 
for(scnl=l; scnl<=10; scnl++) { 
double v = bdproc.sc_la(have-(10-scnl),lane), 
p = ht*padding[scnl][*pl], 
vm = 0.5*(1.0+cos(double(scnl)*NR_PI/10.0)), 
vp - 0.5*(1.0+cos(NR_PI+double(scnl)*NR_PI/10.0)); 
bdproc.sc_la_set(scnUane,vm*v + vp*p); 

} 

for(scnl=l; scnl<=need; scnU-h) 
bdproc.scja_set(have+scnljane,ht*padding[10+scnl][*pl]); 

} 

::free_dmatrix( padding, 1,10+need, 1,NPLN ); 
::free_dvector( v, l,have); 

} 

Wvfm source( bdproc ); 
sr.blindeconv( bdproc, FBW ); 
if(l != sr.xtranorm( bdproc, source, have, pass2 )) 
return 0; 

Wvfm aligned( NPTS+sr.nsv().maxshft(), 4, inordr(), ds(), method() ); 
aligned.ssm( ssm() ); 
for(lane= 1 ; lane<=4; lane++) { 

short sv = sr.nsv().s(lane); 

for(scnl=l; scnl<=NPTS; scnl++) 
aligned.sc_la_set( sv+scnl, lane, bdproc.sc la(scnLlane) ); 

} 

if(pass2) 

aligned.endi( have+srnsv().maxshft() ); 
sr.wvfm( aligned ); 
ShftVect noshft; 
sr.wvfm()->envelope( noshft ); 
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return sr.nrefine( lnordr(), FBW, have, pass2 ); 




5 



* FILE: 



nrefine.cxx 



♦AUTHOR: Andy Marks 

* COPYRIGHT (c) 1996, University of Utah 

*/ 

#include <basecall/mb.hxx> 
] 0 include <basecall/fgapcheck.h> 
int 

insMetric( int const* px, int const* py, int* ytmp, int N ) 

{ 

float coef[4], std; 
1 5 int idx, jdx, *xtmp = NULL; 
if(N<4) 
return 0; 

if(l != ::iquadratic( px,py,N, coef )) 
return 0; 

20 std = float( ::sqrt( coefl3] ) ); 
xtmp = ::ivector(l,N); 
if(NULL = xtmp) return 0; 
for(jdx=idx=l; idx<=N; idx++) { 
float x = float(px[idx]); 
25 ytmp[jdx] = int(coef[0)+x*coefll]+x*x*coef[2]); 
xtmp[jdx] = px[idx]; 
if(abs(ytmp[jdx]-py[idx]) < std) 
jdx-H-; 

} 

30 if(«jdx >= 4) { 

if(l \- ::iquadratic( xtmp,ytmp,jdx, coef)) { 
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: :free_ivector(xtmp, 1 ,N); 
return 0; 

} 

for(idx=l; idx<=N; idx++) { 
5 float x = float(px[idx]); 

ytmp[idx] = int(coefl0]+x*coefll]+x*x*coef[2]); 

} 

} 

::free_ivector(xtmp, 1 ,N); 
10 if(0.0f!=coefI2]) { 

int Ipt = (int)nint(-coefll]/(2.0*coef!2])); 
if(Ipt>=l && lpt<=N) { 
int K = ytmp[Ipt]; 
if(coef[2] > 0.0) 
1 5 for(idx= 1 ;idx<lpt;idx++) 

ytmp[idx] = K; 
else for(idx=(Ipt+l );idx<=N;idx++) 
ytmp[idx] = K; 

} 

20 } 

return 1 ; 

} 

int 

SegRead::centroid_( int bgn, int end ) const 

25 { 

Wvfm const& w = *wvfm(); 
double numer=0.0, denom=0.0; 
int idx=bgn+ 1 , jdx=idx- 1 , N = end-bgn+ 1 ; 
if(N < 2) 
30 return (bgn+N/2); 

else { 
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double minv = 100000.0; 
for(int kdx=bgn;kdx<=end;kdx++) 
if(w.enw(kdx)<minv) 
minv = w.enw(kdx); 
5 for( ;idx<=end; idx++jdx++) { 
double vj = w.enw(jdx)-minv; 
double vi = w.enw(idx)-minv; 
numer += vj*(2.0*jdx + idx) + vi*Qdx + 2.0*idx); 
denom += (vj + vi); 
10 } 
} 

return int(nint(numer/(3 .0 * denom))) ; 

} 

int 

1 5 SegRead::nrefme( char const* LNORDR, int FBW, int npts, int pass2 ) 
{ 

PKDET rawPks, 
refPks; 

float *xbnd, *ht, *lo, **om; 
20 int 'bandcode, *insSP, *insWD, NPK, idx, jdx, sts; 
if(l != peakdet( npts, rawPks )) 

return 0; 
NPK = rawPks.npk(); 
xbnd = xbndara_( rawPks ); 
25 if(NULL == xbnd) { 

status, = STS_NO_MEM; 
return 0; 

) 

if(NULL = (bandcode = ::ivector(l, nWvf_->scanl() ))) { 
30 status_=STS_NO_MEM; 

::free_vector( xbnd,l.NPK ); 
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return 0; 

} 

maxlanecode ( npts, bandcode ); 
if(NPK >= STATIC_BUF_SZ) { 
5 ::free_vector( xbnd,l ,NPK ); 

status, = STS_BUF2SMALL; 

return 0; 

} 

insSP = ::ivector( 1, NPK ); 
10 if(NULL — insSP) { 

status_ = STS_NO_MEM; 
::free_vector( xbnd,l,NPK ); 
return 0; 
} 

15 ifi(l != ::insMetric( rawPks.bmid(), rawPks.lgap(), insSP, NPK )) { 

status_ = STS_TOO_FEW; 

::free_vector( xbnd,l,NPK ); 

: : free_ivector(insSP, 1 ,NPK); 

return 0; 
20 } 

ht = ::vector( 1,NPK); 
if(NULL = ht) { 

status_ = STS_NO_MEM; 

::free_vector( xbnd,l,NPK ); 
25 : :free_ivector( insSP, 1 , NPK ); 

return 0; 

} 

for(idx = 1 ; idx <= NPK; idx-H-) 
ht[ idx ] = float(nWvf_->enw( rawPks.bmid(idx) )); 
30 lo=::vector( l.NPK); 
if(NULL = lo) { 
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status_ = STS_NO_MEM; 
::frec_vector( xbnd.l.NPK ); 
::free_ivector( insSP, 1, NPK ); 
::free_vector( ht, 1,NPK ); 
5 return 0; 

} 

for(idx = 1 ; idx <= NPK; idx++) { 
float onsv, ofsv; 

onsv = float(nWvf_->enw( rawPks.bbgn(idx) )); 
1 0 ofsv = float(nWvf_->enw( rawPks.bend(idx) )); 
lo[ idx ] = onsv<ofsv?onsv:ofsv; 

} 

om « ::matrix(l JsIPK.,1,2); 

sts = ::ornitokn(insSP,ht,lo,rawPks.lgap(),rawPks.rgap(),xbnd,NPK,om); 
15 ::free_vector( xbnd, l,NPK);xbnd = NULL; 
: :free_vector( ht, 1 ,NPK ); ht = NULL; 
::free_vcctor( lo, 1 ,NPK ); lo = NULL; 
::free_ivector( insSP, 1,NPK ); insSP = NULL; 
if(l !=sts) { 
20 ::free_matrix( om, 1, NPK, 1,2 ); 

status. = STS_TOO_FEW; 
return 0; 
} 

enum OKNOMIT { BAND_OK = 1, BAND_N = 2, BAND_OMIT = 3 }; 
25 for(jdx=idx= 1 ;idx<=NPK;idx++) 

switch(OKNOMIT(nint(om[idx][l]))) { 
case BAND_N: 
bandcode[ rawPks.bmid( idx ) ] = 5; 
case BANDOK: 
30 nband_[ jdx ] = rawPks.band( idx ); 

seq_[ jdx ] = LNORDR[ nWvf_->envi( rawPks.bmid( idx ) )-l ]; 
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jdx++; 
break; 

} 

::free_matrix( om, 1, NPK, 1,2 ); om = NULL; 
5 NPK=jdx-l; 

refPks.set( nband_, NPK ); 
insSP = ::ivector( 1,NPK); 
if(NULL = insSP) { 
status_ = STS_NO_MEM; 
10 return 0; 

} 

if(l != ::insMetric( refPks.bmid(), refPks.lgap(), insSP, NPK )) { 
status_ = STS_TOO_FEW; 
::free_ivector( insSP, 1, NPK ); 
15 return 0; 

} 

ins WD = ::ivector( 1, NPK ); 
iftNULL == insWD) { 

status_ = STS_NO_MEM; 
20 : :free_ivector( insSP, 1 ,NPK ); 

return 0; 

} 

if(l != ::insMetric( refPks.bmid(), refPks.bwid(), insWD, NPK )) { 
status_ = STS_TOO_FEW; 
25 ::free_ivector( insSP,l,NPK ); 

::free_ivector( ins WD, 1, NPK ); 

return 0; 

} 

om = ::matrix(l,NPK,l,2); 
30 sts = : :gapcheck(insSP, insWD, refPks.lgap(),refPks.bwid(),seq_, NPK,om); 
::free_ivector( ins WD, 1 ,NPK ); insWD = NULL; 
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if(l !=sts) { 
::freeJvector( insSP,l,NPK); 
::free_matrix(om, 1 ,NPK, 1 ,2); 
return 0; 

5 } 

nbandj 1 ] = refPks.band( 1 ); 
for(jdx=idx=2; idx<=NPK; idx++) { 
if(GAP_SPLIT = GPCHK(nint(ora[idx][l]))) { 
int m4, numSmallGap, newSP, mid; 
10 double ratio; 

m4 = idx- 1 ; 
if(0 = insSP[m4]) { 
status, = STS_TOO_FEW; 
::free_ivector( insSP,l,NPK ); 
1 5 : :free_matrix( om, 1 ,NPK, 1 ,2 ); 

return 0; 

} 

ratio - double(refPks.rgap(m4))/double(insSP[m4]); 
numSmallGap = int(0.5 + 0.2 + ratio); 
20 if(0 = numSmallGap) { 

status. = STS_TOO_FEW; 
::free_ivector( tnsSP,l,NPK ); 
::free_matrix( om,l 5 NPK,l,2 ); 
return 0; 
25 } 

newSP = refPks.rgap( m4 ) / numSmallGap; 
mid = refPks.bmid( m4 ); 
for(int ndx=l; ndx<numSmallGap; ndx-H-) { 
int bgn,end,c; 
30 mid+=newSP; 

bgn = mid-newSP/2; 
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end = bgn+newSP-l ; 

Band b( bgn, c=centroid_(bgn,end). end, 1 ); 
nband_[jdx-t-+ ] = b; 
ifOdx >= STATIC_BUF_SZ) { 
5 status, = STS_BUF2SMALL; 

::free_ivector( insSP,l,NPK); 
::free_matrix( om,l ,NPK,1,2 ); 
return 0; 
} 

10 } 
} 

nbandj jdx++ ] = refPks.band( idx ); 
if(jdx >= STATIC_BUF_SZ) { 

status_ = STS_BUF2SMALL; 
1 5 : :free_i vector( insSP, 1 ,NPK ); 

::free_matrix( om,l ,NPK, 1 ,2 ); 

return 0; 

} 

} 

20 ::free_ivector( insSP, 1 ,NPK ); insSP = NULL; 
: :free_matrix( om, 1 ,NPK, 1 ,2 ); om = NULL; 
NPK = jdx-l; 
refPks.set( nband_, NPK ); 
insSP=::ivector(l,NPK); 
25 if(NULL = insSP) { 

status_ = STS_NO_MEM; 

return 0; 

} 

if(l != ::insMetric( refPks.bmid(), refPks.lgap(), insSP. NPK )) { 
30 status_ = STS_TOO_FEW; 

::free_ivector( insSP, 1, NPK. ); 
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xbnd = xbndaraj refPks ); 
if(NULL = xbnd) { 
5 status_ = STS_NO_MEM ; 

::free_ivector( insSP, 1, NPK); 

return 0; 

} 

ht=::vector( 1,NPK); 
10 if(NULL==ht) { 

status, = STS_NO_MEM; 
::free_ivector( insSP,l,NPK ); 
: :free_vector( xbnd, 1 ,NPK ); 
return 0; 

15 } 

for(idx=l ;idx<=NPK;idx++) 
ht[ idx ] = float(nWvf_->envv( refPks.bmid(idx) )); 
lo=::vector( 1,NPK ); 
if(NULL == lo) { 
20 status_ = STS_NO_MEM ; 

::ftee_ivector( insSP,l,NPK ); 
::free_vector( xbnd, 1, NPK); 
::free_vector( ht,l,NPK); 
return 0; 
25 > 

for(idx = 1 ; idx <= NPK; idx++) { 
float onsv, ofsv; 

onsv » float(nWvf_->enw( refPks.bbgn(idx) )); 
ofsv = float(nWvf_->enw( refPks.bend(idx) )); 
30 lo[ idx ] = onsv<ofsv?onsv:ofsv; 
} 
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om = ::matrix(l,NPK,l,2); 

sts = ::omitokn(insSP,ht,lo,refPks.lg^)(),refPks.rgap(),xbnd,NPK,ora ); 
::free_vector( xbnd, l,NPK);xbnd = NULL; 
::free_vector( ht, l,NPK);ht = NULL; 
5 ::free_vector( lo, l,NPK);lo = NULL; 

::free_ivector( insSP, 1,NPK ); insSP = NULL; 
if(l !=sts) { 

: :free_matrix(om, 1 ,NPK, 1 ,2); 
return 0; 
10 } 

for(idx=jdx=l; idx<=NPK; idx++) { 
int bmid = refPks.bmid(idx); 
switch(OKNOMIT( nint(om[idx][l]) )) { 
case BAND_N: 
1 5 bandcode[ bmid ] = 5; 

case BANDOK: 
nband_[ jdx ] = refPks.band(idx); 
seq_[ jdx ] = LNORDR[ bandcode[ bmid ]-l ]; 
jdx++; 
20 break; 

default: 
break; 

} 

} 

25 ::free_matrix( om, 1 ,NPK.,1,2 ); om = NULL; 

::free_ivector( bandcode, 1 ,nWvf_->scanl() ); bandcode = NULL; 

NPK=jdx-l; 

refPks.set( nband_, NPK ); 

return setBandStats( refPks, FBW, seq_, pass2 ); 

30 } 
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/**+*+++*+******** ************* **************************************** 

* FILE: nrutil.cxx 

* TYPIST: Andy Marks 
*/ 

5 #include <stdio.h> 
#include <stddef.h> 
#include <stdlib.h> 
#include <nrc/nrutil.hxx> 
#defineNR_END 1 
1 0 #define FREE_ARG char* 
void 

nrerror( char const error_text[] ) 
{ 

fprintf(stderr,"Numerical Recipes run-time error.An"); 
1 5 fprintf(stderr,"%s\n", errorjext ); 

fprintf(stderr,"...now exiting to system.. An"); 
exit(l); 

} 

float* 

20 vector( long nl, long nh ) 
{ 

float *v; 

v = (float*)malloc((size_t)((nh-nl+H-NR_END)*sizeof(float))); 
if^NULL = v) nrerror("allocation failure in vector()"); 
25 return v-nl+NR_END; 
} 

int* 

ivector( long nl, long nh ) 

{ 

30 int *v; 

v = (int*)malloc((size_t)((nh-nl+l+NR_END)*sizeof(int))); 



SUBSTITUTE SHEET (RULE 26) 



WO 98/11258 



PCT/US97/16933 



110 

if(NULL = v){ 
::fprintf(stderr,"ivector: nl=%ld nh=%ld v=%p\n",nl,nh,v); 
::nrerror("ivector: allocation failure"); 

} 

5 return v-nl+NR_END; 
} 

unsigned char* 

cvector( long nl, long nh ) 

{ 

10 unsigned char *v; 

v - (unsigned char*)malloc((sizeJ)((nh-nl+HNR_END)*sizeof(unsigned char))); 
if(NULL = v) nrerror("allocation failure in cvector()"); 
return v-nl+NR_END; 

} 

15 unsigned long* 

lvector( long nl, long nh ) 
{ 

unsigned long *v; 

v = (unsigned long*)malloc((size - t)((nh.nl+l+NR,END)*sizeof(unsigned long))); 
20 if(NULL ===== v) nrerror(" allocation failure in lvector() M ); 
return v-nl+NR_END; 

} 

double* 

dvector( long nl, long nh ) 
25 { 

double *v; 

v = (double* )malloc((size_t)((nh-nl+l+NR_END)*sizeof(double))); 
if(NULL — v) nrerror("allocation failure in dvector()"); 
return v-nl+NR_END; 

30 } 

float** 
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matrix(long nrl.long nrh,long ncl,long nch) 
{ 

long i, nrow = nrh-nrl+1, ncol=nch-ncl+l; 
float **m; 

5 m = (float**)malloc((sizeJ)((nrow+NR_END)*sizeof(float*))); 
if(NULL = m) nrerrorfallocation failure 1 in matrixO"); 
m+=NRJEND; 
m — nrl; 

mfnrll = (float*)malloc((sizeJ)((nrow*ncol+NR - END)*sizeof(float))); 
10 if(NULL = m[nrl]) nrerror("allocation failure 2 in matrix()"); 
m[nrl] += NREND; 
nifnrl] -= ncl; 

for(i=nrl+l;i<=nrh;i++) m[i] = m[i-l)+ncol; 
return m; 

15 } 

double** 

dmatrix(long nrl,long nrh,long ncl,long nch) 
{ 

long i, nrow = nrh-nrl+1 , ncol=nch-ncl+l ; 
20 double **m; 

m = (double**)mal)oc((size_t)((nrow+NR_END)*sizeof(double*))); 

if(NULL = m) { 
::fprintftstderr,"nrl=%ld, nrh=%ld, ncl=%ld, nch=%ld, nrow=%ld\n*\ 
nrl,nrh,ncl ,nch,nrow); 
25 nrerror("allocation failure 1 in dmatrixO"); 
} 

m += NRJEND; 
m -= nrl; 

m[nrl] = (double*)malloc((size_t)((nrow*ncol+NR_END)*sizeof(double))); 
30 if(NULL — m[nrl]) nrerror("allocation failure 2 in dmatrix()"); 
m[nrl]+=NR_END; 
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m[nrl] -= ncl; 

for(i=nrl+l ;i<=nrh:i++) m[i] = m[i-l)+ncol; 
return m; 

} 

5 int** 

imatrix(long nrljong nrh,long ncljong nch) 

{ 

long i, nrow = nrh-nrl+1, ncol=nch-ncl+l ; 
int **m; 

10 m = (int**)malloc((sizeJ)((nrow+NR_END)*sizeof(int*))); 
if(NULL = m) nrerror("allocation failure 1 in dmatrix()"); 
m += NREND; 
m — nrl; 

m[nrl] = (int+^alloc^sizeJJ^nrow^ncol+N^ENDJ^sizeofCint))); 
1 5 if(NULL = m[nrl]) nrerror( tt allocation failure 2 in imatrix() H ); 
m[nrl] += NR_END; 
m[nrl] — ncl; 

for(i=nrl+l;i<=nrh;i++) m[i] = m[i-l]+ncol; 
return m; 

20 } 

float** 

submatrix(float **a, long oldrl, long oldrh, long oldcl, long oldch, 
long newrl, long newel) 

{ 

25 long i j,nrow=oldrh-oldrH-l,ncol=oldch-oldcl+ 1 ; 
float **m; 

m = (float**)malloc((sizeJ)((nrow+NR_END)*sizeof(float*))); 
if(NULL — m) nrerror(" allocation failure in submatrix()"); 
m+= NREND; 
30 m — newrl; 

for(i = oldrl, j=newrl; i<=oldrh; i++o++) ~ a[i]+ncol; 
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return m; 

} 

float** 

convert_matrix(float* a, long nrl, long nrh, long ncl, long nch) 
5 { 

long i, j, nrow=nrh-nrl+l, ncol=nch-ncl+l ; 
float **m; 

m = (float**)malloc((size_t)((nrow+NR_END)*sizeof(float*))); 
if(NULL == m) nrerror( "allocation failure in convert_matrix()"); 
10 m+=NR_END; 
m — nrl; 
m[nrl] = a-ncl; 

for(i=l j=nrl+l ; i<=nrow; m[j] = m[j-l]+ncol; 

return m; 

15 } 
void 

free_vector(float* v ? long nl, long nh) 
{ 

free((FREE_ARG)(v+nl-NR_END)); 

20 } 
void 

free_ivector(int* v, long nl, long nh) 

{ 

free((FREE_ARG)(v+nl-NR_END)); 

25 } 
void 

free_cvector(unsigned char* v, long nl, long nh) 
{ 

free((FREE_ARG)(v+nl-NR_END)); 

30 } 

void 
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free_lvector(unsigned long* v, long nl, long nh) 

{ 

free((FREE_ARG)(v+nl-NR_END)); 

} 

5 void 

free_dvector( double* v, long nl, long nh) 

{ 

free((FREE_ARG)(v+nl-NR_END)); 

} 

10 void 

free_matrix(float **m, long nrl, long nrh, long ncl, long nch) 

{ 

free((FREE_ARG)(m[nrl]+ncl-NR_END)); 
free((FREE_ARG)(m+nrl-NR_END)); 

15 } 
void 

free_dmatrix(double **m, long nrl, long nrh, long ncl, long nch) 
{ 

free((FREE_ARG)(m[nrl]+ncl-NR_END)); 
20 free((FREE_ARG)(m+nrl-NR_END)); 
} 

void 

free_imatrix(int **m, long nrl, long nrh, long ncl, long nch) 
{ 

25 frce((FREE_ARG)(m[nrl]+ncl-NR_END)); 
free((FREE_ARG)(m+nrl-NR_END)); 

} 

void 

free_submatxix(int **b, long nrl, long nrh, long ncl. long nch) 
30 { 

free((FREE_ARG)(b+nrl-NR_END)); 
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} 

void 

free_convert_matrix(int *b, long nrl) 
{ 

5 free((FREE_ARG)(b+nrI-NREND)); 

} 

* FILE: Pkdet.cxx 
10 * AUTHOR: Andy Marks 

* COPYRIGHT (c) 1996, University of Utah 
*/ 

#include <basecall/mb.hxx> 
PKDET::PKDET() : 
1 5 statusJSTSJNITD), ppk_(NULL), ptr JNULL), gap_(NULL), 

wid_(NULL), ins_(NULL), NpJO), Nt JO), medGapJO), medWidJO) 

{ 
} 

PKDET::PKDET( PKDET const& rhs ) : 
20 statusJSTSJNITD), ppkJNULL), ptr (NULL), gap_(NULL), 

wid_(NULL), ins_(NULL), NpJO), Nt JO), medGapJO), medWid JO) 

{ 

*this = rhs; 

} 

25 PKDET const& 

PKDET::operator=(PKDET const& rhs) 
{ 

if(&rhs != this) { 
release(); 
30 Np_ = rhs.Np_; 
Nt_ = rhs.Nt_; 
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medGap_ = rhs.medGap_; 
medWid_ = rhs.medWid_; 
ppk_ = ::ivector( 1, Np_ ); 
wid_ = ::ivector( 1, Np_ ); 
5 ins_ = ::ivector( 1, Np_ ); 
ptr_ == ::ivector( l,Nt_); 
gap_= ::ivector( l,Nt_); 
tf(!ppk_ || !ptr_ || !gap_ || !wid_ || !insj { 
status_ = STS_N0JVIEM; 
10 release(); 

} 

else { 

for(int idx = 1 ; idx<=Np_; idx++) { 
ppk_[ idx ] = rhs.ppk_[ idx ]; 
1 5 wid_[ idx ] = rhs.widj idx ]; 

ptr_[ idx ] = rhs.ptrj idx ]; 
gap_[ idx ] = rhs.gapj idx ]; 
insj idx ] = rhs.insj idx ]; 
} 

20 ptr_[ idx ] = rhs.ptr_[ idx ]; 

gap_[ idx ] = rhs.gapj idx ]; 
} 

} 

return *this; 

25 } 
void 

PKDET::release() 
{ 

if(NULL != ppkj { ::free_ivector( ppk_, 1, Np_ ); ppk_ = NULL; } 
30 if(NULL != widj { ::free ivector( wid_, 1, Np_ ); wid_ = NULL; } 
if(NULL != insj { ::free_ivector( ins_, 1, Np_ ); ins_ = NULL; } 
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if^NULL != ptrj { ::free_ivector( ptr_, I, Nt_ ); ptr_ = NULL; } 
if(NULL != gap J { ::free_ivector( gap_, 1, Nt_ ); gap_ = NULL; } 
Np_ = Nt_ = 0; 

} 

5 PKDET: :~PKDET() 
{ 

release(); 

} 

static int 

10 i_cmp(void const* el , void const* e2) 
{ 

return (*(int const*)el)-(*(int const*)e2); 

} 

int 

1 5 PKDET: :set( int const* Pp, int Np, int const* Pt, int Nt ) 
{ 

int P1,T1, pxl.pxn, txl,txn, *gtmp, *wtmp; 
release(); 

pxl = Pp[ PW1 ]; pxn = Pp[ Np ]; 
20 txl = Pt[ Tl=l ]; txn = Pt[ Nt ]; 
if(pxl <txl)Pl++; 
if(pxn > txn) Np--; 
Np_ = Np-Pl+l; 
Nt_ = Nt; 

25 if((Nt_ < 2) || (Nt_ != (Np_+ 1 ))) { 

::fprintf(stderr,"PKDET::set, Np=%d, Nt=%d: Nt should — Np+l\n",Np_,NtJ ; 

::fprintf(stderr,"\tpxl=%d txl=%d pxn=%d txn=%d\n",pxl,txl,pxn,txn); 

Nt_ = Np_ = 0; 

return 0; 
30 } 

ppk_ = ::ivector( 1. Np_ ); 
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wid_ = ::ivector( 1, Np_ ); 
ins_ = ::ivector( 1, Np_ ); 
wtmp = ::ivector( 1 , Np_ ); 
gtmp = ::ivector( 1, Np_-1 ); 
5 ptr_ = : :i vector( 1 , Nt_ ); 
gap_ = ::ivector( 1, Nt_); 

if(!ppk_ || !ptr_ || !gap_ || !wid_ || !ins_ || !gtmp || Iwtmp) { 
status, = STS_NO_MEM; 
Nt_ = Np_ = 0; 
10 return 0; 
} 

for(int idx=l; idx<=Np_; idx++) { 
ppk_[idx] = Pp[Pl++]; 
ptr_[idx] = Pt[T1++]; 
15 wid_[ idx ] = Pt[ Tl ]-Pt[Tl-l ]; 
ins_[ idx ] = 0; 
wtmp[ idx ] = wid_[ idx ]; 
if(idx<NpJ{ 

gap_[ 1+idx ] = Pp[ PI ] - Pp[ Pl-1 ]; 
20 gtmp[ idx ] = gap_[ 1+idx ]; 

} 

} 

ptrjidx] = Pt[Tl]: 

::qsort( &gtmp[l], Np_-1 , sizeofi(*gtmp), i_cmp ); 
25 if((Np_-l)&l) { 

intraid= l+(Np_-l)/2; 
mcdGap_ = gtmp[mid]; 

} 

else { 

30 intmid = (Np_-l)/2; 

medGap_ = (gtmp[mid] + gtmp[mid+l]V2; 

SUBSTITUTE SHEET (RULE 26) 



WO 98/11258 



PCT/US97/16933 



119 

} 

gapjl ) = gap_[Nt J = medGap_; 

::free_ivector( gtmp, l,Np_-l ); 

::qsort( &wtmp[l], Np_, sizeof(*wtmp), i.cmp ); 

medWid_ = (Np_&l)? wtmp[l+Np_/2): (wtmp[Np_/2]+wtmp[l+Np_/2])/2; 
::free_ivector( wtmp, l,Np_); 
return 1; 

} 

int 

PKDET::set( Band* Pb, int Nb ) 

{ 

int idx, *gtmp, *wtmp; 
release(); 
Np_ = Nb; 
Nt_ = Nb+l; 
if(Nt_ < 2) { 

"fprintf(stderr,*TKDET::set, Np=%d, Nt=%d: Nt!=Np+l\n*\Np_,Nt_); 
Nt_ = Np_ = 0; 
return 0; 

} 

ppk_ = ::ivector( 1 , Np_ ); 
wid_ = ::ivector( 1 , Np_ ); 
ins_ = ::ivector( 1, Np_ ); 
ptr_ = ::ivector( 1 , Nt_ ); 
gap_ = ::ivector( 1, Nt_ ); 
wtmp = ::ivector( 1, Np_ ); 
gtmp = ::ivector( 1, Np_-l ); 

ifl'.ppk_ II !ptr_ || !gap_ II !wid_ || !ins_ || !gtmp || !wtmp) { 
status_ = STS_NO_MEM; 
Nt_ = Np_ = 0; 
return 0; 



SUBSTITUTE SHEET (RULE 26) 



WO 98/11258 



PCT/US97/16933 



120 

} 

for(idx=l; idx<Nb; idx++) { 
Band bn = Pb[idx], bnp 1 = Pb[idx+ 1 ]; 
if(bn.end() !=bnpl.bgn()) 
5 if(bn.wid() > bnpl .wid()) 

Pb[idx].end( bnpl.bgn()); 
else 

Pb[idx+l].bgn(bn.end()); 

ppkjidx] = Pb[idx].mid(); 
10 ptr_[idx] =Pb[idx].bgn(); 

widjidx] = Pb[idx].wid(); 

insjidx] = Pb[idx].ins(); 

wtmp[idx] = widjidx]; 

gapjl+idx] = Pb[idx+l].mid() - Pb[idx].mid(); 
15 gtmp[idx] = gapjl+idx]; 

} 

ppk_[ idx ] = Pb[ idx ].mid(); 

ptrjidx] =Pb[idx].bgn(); 

ptr_[ idx+1 ] = Pb[ idx ].end(); 
20 widjidx] =ptr Jidx+1] -ptrjidx]; 

ins_[ idx ] = Pb[ idx ].ins(); 

wtmp[ idx ] = widj idx ]; 

::qsort( &gtmp[l], Np_-1, sizeof(*gtmp), i_cmp ); 

if((Np_-l)&l){ 
25 intmid= l+(Np_-l)/2; 

medGap_ = gtmp[mid]; 
} 

else { 
intmid = (Np_-l)/2; 
30 medGap_ = (gtmp[mid]+gtmp[mid+l ])/2; 
} 
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gap_D] = gap_[NtJ = medGap_; 

::free_ivector( gtmp, l,Np_-l ); 

::qsort( &wtmp[l], Np_, sizeof(*wtmp), i cmp ); 

medWid_ = (Np_&l)? wtmp[l+Np_/2]: (wtmp[Np_/2]+wtmp[l+Np_/2])/2; 
5 ::free_ivector( wtmp, l,Np_); 
return 1; 

} 

void 

PKDET::debug( int Ivl ) const 
10 { 

::printf( H PKDET::debug\n"); 

::printfC'Np=%3d Nt=%3d medGap=%2d medWid=°/o2d\n", 
Np_,Nt_,medGap_,medWid_); 
if(NULL=ptr_) 
15 ::primf("PKDET::debug failed: ptr_=%p\n",ptrj; 

else if(NULL = ppkj 

::printf("PKDET::debug failed: ppk_=%p\n",ppk_); 
else if(NULL == gap_) 
::printf( H PKDET::debug failed: gap_=%p\n",gap_); 
20 else if(NULL = widj 

::printfl["PKDET::debug failed: wid_=%p\n",widj; 
else if(NULL — ins J 

::printf("PKDET::debug failed: ins_=%p\n",insj; 
else for(int idx=l; idx<=Np_; idx++) { 
25 ::printf("%3d: %4d|%4d|%4d, lgap=%2d rgap=%2d width=%2d ins=%d\n", 
idx,ptr_[idx],ppk_[idx],ptr_[idx+ 1 ],gap_[idx],gap_[idx+l ], 
wid_[idx],ins_[idx]); 
::fflush( stdout ); 
> 

30 } 
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* FILE: Pkdet.hxx 

* AUTHOR: Andy Marks 

* COPYRIGHT (c) 1996, University of Utah 
5 */ 

#if !defined(_PKDET_HXXJ 

* define _PKDET„HXX_ 
#include <stdio.h> 

class Band 

10 { 

public: 

Band() : bJO), mJO), eJO), ins JO) {;} 

Band(int b ? int m, int e, int i) : bjb), mjm), eje), insji) {;} 

~Band(){;} 
1 5 int bgn() const { return b_; } 

int mid() const { return m_; } 

int end() const { return e_; } 

int wid() const { return e_-b_+l ; } 

int ins() const { return ins_; } 
20 void bgn( int b ) { b_ = b; } 

void mid( int m ) { m_ = m; } 

void end( int e ) { e_ = e; } 

void ins( int i ) { ins_ = i; } 

void debug( int lvl=0 ) const 
25 { 

:;printf("%d,%d,%d (inserted=%d)\n\b_,m^e_,insj; 
::fflush(stdout); 

} 

private: 

30 short b_. m_, e_, ins_; 

}; 



SUBSTITUTE SHEET (RULE 26) 



WO 98/11258 



PCT/US97/16933 



class PKDET 
{ 

public: 

enum Status { STS_UNINITD, STSJNITD, STS_NO_MEM }; 
5 PKDET(); 

PKDET(PKDET const& rhs); 

PKDET const& operator=(PKDET const& rhs); 

~PKDET(); 

int set( int const* ppk ? int Np, int const* ptr, int Nt ); 
1 0 int set( Band* pb, int Nb ); 

Status statusQ const { return status^; } 

int npk() const { return Np_; } 

int ntr() const { return Ntj, } 

int const* bbgn() const { return ptr_; } 
15 int const* bmid() const { return ppk_; } 

int const* bend() const { return &ptr_[l]; } 

int ins(int idx) const { return ins_Jidx]; } 

int const* bwid() const { return wid_; } 

int bbgn(int idx) const { return ptrjidx]; } 
20 int bmid(int idx) const { return ppk_[idx]; } 

int bend(int idx) const { return ptrjl+idx]; } 

int bwid(int idx) const { return widjidx]; } 

int lgap(int idx) const { return gap_[idx]; } 

int rgap(int idx) const { return gap_[l+idx]; } 
25 Band band(int idx) const { 

Band b(bbgn(idx),bmid(idx),bend(idx),ins(idx)); 
return b; 

} 

int const* lgap() const { return gap_; } 
30 int const* rgap() const { return &gap_[l]; } 
int medGapQ const { return medGapj } 
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int medWid() const { return medWid_; } 
void debug( int lvl = 0 ) const; 
private: 
Status status_; 
5 int *ppk_, 
*ptr__, 

*gap_, 
*wid_, 
*ins_; 
10 int Np_, 

mcdGap_, 
medWid_; 
void release(); 

15 }; 

#endif 

^* *********** ******* * ****** ******************************************* 

* FILE: polfit.cxx 
20 * AUTHOR; Philip R. Bevington, p 140-141 
*/ 

#include <nrc/nr.hxx> 
#if Idefined(SA) 
int 

25 polfit( float const* px, float const* py , 

float const* sigmaY, 

intNPTS, 

int NTERMS, 

PFWGHT mode, 
30 float coefll, 

float& chisq ) 
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{ 

intNMAX, inr,jnr; 

float degfre, **soln, *sumx, *sumy, **airay; 
NMAX = 2*NTERMS- 1 ; 
sumx = ::vector(l,NMAX); 
sumy = ::vector(LNTERMS); 
soln =::matrix(l,NTERMS,l,l); 
array = : :matrix( 1 ,NTERMS, 1 ,NTERMS); 
for(inr=l ;inr<=NMAX;inr++) 
sumx[inr] = O.Of; 
for(inr=l ;inr<=NTERMS;inr++) 
sumy[inr] = O.Of; 
chisq = O.Of; 

for(inr= 1 ;inr<=NPTS ;inr++) { 
float x, y, wt, xterm, yterm; 
x = px[inr]; 
y = py[inr]; 
switch(tnode) { 
case INSTRUMENTAL: 
20 if(y<0) wt = -1.0f/y; 

else if(0==y)wt= l.Of; 
else wt=1.0f/y; 
break; 

case NO WEIGHTING: 
25 wt = 1 .Of; 

break; 

case STATISTICAL: 
wt = 1.0f/(sigmaY[inr]*sigmaY[inr]); 

break; 

30 } 

xterm = wt; 
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for(jnr=l;jnr<=NMAXynr++) { 
sumxfjnr] += xterm; 
xterm *= x; 

} 

yterm = wt*y; 

forQni=l ynr<=NTERMS;jnrH-) { 
sumyfjnr] += yterm; 
yterm *= x; 

} 

chisq += wt*y*y; 

} 

for(inr=l;inr<=NTERMS;inr++) { 
soln[inr][l] = sumyfinr]; 
forOnr=l ;jnr<=NTERMSynr-H-) 
array[inr][jnr] = sumx[inr+Jnr-l]; 

} 

::gaussj( array .NTERMS, soln,l); 
for(inr=l;inr<=NTERMS;inr-H-) { 

coeflinr] = soln[inr][l]; 

chisq -= 2.0f*soln[inr][l]*sumy[inr]; 

for(jnr=l ;jnr<=NTERMSynr++) 
chisq += soln[iru-][1]*soln[jnr][l]*surnx[inr+jnr-l]; 

} 

::free_vector(sumx,l,NMAX); 

: :free_vector(sumy , 1 .NTERMS); 

: :free_matrix(soln, 1 ,NTERMS, 1,1); 

: :free_matrix(array , 1 ,NTERMS , 1 ,NTERMS); 

if(0.0f = (degfre = float(NPTS - NTERMS))) 

return 1 ; 
chisq /= degfre; 
return 0; 
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} 

#endif 

#ifdefined(SA) 
^include <stdio.h> 
5 int 

main(int argc, char* argv[]) 

{ 

static float x[] = 
{ 

10 O.Of, 

282.0f. 338.0f. 393.0f, 460.0f, 486.0f, 
512.0f. 564.0f. 615.0f, 668.0f, 726.0f, 

780.0f, 832.0f. 886.0f, 940.0f, 993 .Of, 
1 047.0f, 1 099.0f. 1 1 52.0f,l 204 .Of, 1262.0f, 
15 131 7.0f, 1 3 70.0f, 1 424.0f, 1 476.0f, 1 532.0f, 

1 587.0f, 1 640.0f. 1 694.0f,l 746.0f, 1 80 1 .Of 

}; 

static float y[] = 

{ 

20 O.Of. 

1436.0f,1420.0f,1408.0f,1404.0f,1400.0f, 
1 38 8.0f, 1 384.0f, 1 364.0f, 1 364.0f, 1 352.0f, 

1 348.0f, 1 340.0f, 1 340.0f, 1 336.0f,l 328.0f, 
1 328.0f, 1 328.0f, 1 328.0f,l 324.0f, 1 328.0f, 
25 1328.0f,1324.0f,1324.0f,1324.0f,1324.0f, 
1320.0f,1332.0f,1332.0f,1332.0f,1328.0f 

}; 

# define NPTS (sizeof(x)/sizeof(x[Oj) - 1 ) 
float ochisq; 
30 for(int tnr=2;tnr<=6;tnr++) { 
float chisq. coef[8]; 
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(void)polfit(x,y,NULL,NPTS.tnr ? NO_WEIGHTING.coef,chisq); 
: :printfnntnr=%d chisq=%f\n",tnr,chisq); 
for(int t=l ;t<=tnr;t++) 
::printf("\tcoefI%d)=%f\n ,, ,t,coeflt]); 
5 if(2!=tnr) { 

float improv = lOO.Of ochisq/chisq; 
::printf("IMPROVED BY %7.2P/o%\n'\improv); 
if(improv< 115.0f) 
break; 

10 } 

ochisq = chisq; 

} 

} 

#endif 

15 

*********************** 

* FILE: preproc.cxx 
♦AUTHOR; Andy Marks 

* COPYRIGHT (c) 1996, University of Utah 
20 */ 

^include <basecall/mb.hxx> 
struct MINV { double v; int i; }; 
void 

Wvfm: :truvel Adjust() 
25 { 

double const EXPONENT - 1 .0/2.2; 
double CONSTANT = ::pow( 1.0/255.0, EXPONENT ); 
double minv[5]; 
int ldx, sdx; 
30 for(ldx = 1 ; Idx <= lanes(); ldx++) { 
minv[ldx]= 1000.0; 
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for(sdx = bgni(); sdx <= endi(); sdx++) { 
double v = sc_la(sdx,ldx); 
if(v<=63.0) v = v*0.2511; 
else if(v <= 127.0) v = (V0.7028 - 255.0*0.1 129); 
5 elseif(v<= 191.0) v = (v*1.245 -255.0*0.3887); 

else v = (v*1.7916- 255.0*0.7916); 

v = pow( 1 .0/v, EXPONENT ) - CONSTANT; 
sc_la_set( sdx, ldx, v ); 
if(v < minv[ldx]) minv[ldx] = v; 

10 } 

} 

for(ldx = 1 ; ldx <= lanes(); ldx++) { 
double mn = minvfldx]; 
for(sdx = bgni(); sdx <= endi(); sdx++) 
1 5 sc_la_sub( sdx.ldx, mn ); 

} 

} 

static void 

minimum* double **pm, int ROW, int COL, int N, MINV& rm ) 
20 { 

rm.v = 

pm[nn.i=ROW]lCOL]; 
for(int sdx = ROW+1 ; sdx<RO W+N; sdx-H-) 
if(pm[sdx][COL]<mi.v) 
nn.v = pm[ rm.i=sdx ][ COL ]; 

25 } 

void 

Wvfm::baseline( intN ) 
{ 

int LEN = endi()-bgni()+l , sdx, ldx; 
30 MINV minv[5]; 
double **pm; 
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pm = ::dmatrix( 1, LEN + (2*N), 1, lanes() ); 
for(ldx=l ; ldx<=lanes(); ldx++) { 
double SI = sc_la( bgni(), ldx ), 

SN = sc_la( endi(), ldx); 
for(sdx=l; sdx<=N; sdx++) { 
pra{sdx][ ldx] = Sl; 
pm[N+LEN+sdx][ ldx ] = SN; 
} 

for(sdx=l; sdx<=LEN; sdx++) 
pm[N+sdx ][ ldx ] = sc_la( bgni()+sdx-l, ldx); 

} 

for(ldx=l; ldx<=lanes(); ldx++) 
minimum( pm, 1, ldx, 2*N, minvfldx] ); 

for(ldx = 1; ldx <= lanes(); ldx++) { 
int irnr, lrnr; 
irnr = bgni(); 

for(lrnr = N+1; lrnr <= N+LEN; irnr++) { 
pm_[ irnr ][ ldx ] = pm[lmr++][ldx] - minv[ldx].v; 
if( minv[ ldx ].i < (lrnr-N) ) 
minimum( pm, lrnr-N, ldx, 2*N, rainv[ldx] ); 
else if( minv[ ldx ].v > pm[ lrnr+N-1 ][ ldx ]) 
minv[ ldx ].v = pm[ minv[ ldx ].i = lrnr+N-1 ][ ldx ]; 

} 

} 

: : free_dmatrix(pm, 1 ,LEN+2*N, 1 , lanes()); 

} 

void 

Wvfm::noZeros() 
{ 

for(int ldx = 1 ; ldx <= lanes(); ldx++) 
for(int sdx = bgni(); sdx <= endi(); sdx++) 
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ifi(0.0 = pmjsdx][ldx]) 
pm_[sdx][ldx] = DBL_EPSILON; 

} 

int 

5 Wvfm::mdynpre() 
{ 

bgnEnd(); 

ifKl != specSepO) 

return 0; 
10 if(_lX!=ibJ{ 

double **pn; 

float *x, *y, *y2; 

intr,R,s,c,k^F,N,NP; 

NP = C3X==ibJ? 3: 2; 
15 NF = (scanl()+l)/NP; 

N =(NF*NP*NP+1); 

if(NULL = (pn = ::dmatrix( l,N, l,lanes()))) { 
status_ = STS_NO_MEM; 
return 0; 
20 } 

x= ::vector(l,4); 
y = ::vector(l,4); 
y2 = ::vector(l,4); 
if('.x|| !y(| !y2){ 
25 status, = STS_NO_MEM; 

return 0; 
} 

for(c=l; c<=lanes(); C++) { 
R = (bgni()-1)*NP+1; 
30 for(r=bgni(); r<=(endi()-3); r+=NP) { 

for(s=0; s<=3;s++) { 
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x[s+l] = float(r+s); 
y[s+l] = float(sc_la(r+s,c)); 
} 

::spline( x, y, 4, y[2]-y[l], y[4]-y[3], y2); 
for(s=0; s<NP; s++) 
for(k=0; k<NP; k-f+) { 
float yout, xin; 

xin = float(r+s)+float(k)/float(NP); 
::splint( x, y, y2, 4, xin, &yout); 
pn[R++][c] = double(yout); 
} 

} 

pn[R][c] = y[4]; 

} 

: : free_vector(x, 1 ,4); 
::free_vector(y,l,4); 
: :free_vector(y2,l ,4); 

pm( pn, R, bgni()*NP-NP+l,endi()*NP-NP+l ); 

} 

return 1; 

} 

int 

Wvfm::preproc() 
{ 

if(MDYN == dsj 

if(l !=mdynpre()) 
return 0; 
if(TRUVEL==dsJ 

truvelAdjust(); 
else if(MDYN != ds J 

baseline( 50 ); 
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noZeros(); 
bgni_++; 
endi_~; 
return 1; 

5 } 

/***%***++** ******** ********************************************* 

* FILE: Quadratic.cxx 
♦AUTHOR: Andy Maries 
10 * COPYRIGHT (c) 1996, University of Utah 
*/ 

#include <stdio.h> 
^include <float.h> 
#if defined(sun) 
15 # include <ieeefp.h> 
#endif 

#ifdefmed(_WIN32) 
# define finite _fmite 
#endif 

20 #include <nrc/nr.hxx> 
int 

ilinreg( int const* px, int const* py, int N, float* coef, float* prr) 
{ 

float **m, **v, muY=0.Of; 
25 double sx, sx2, ss_about_mu=0.0, ss_due_reg=0.0, ss_about_reg=0.0; 
int idx; 

coeflO] = coefll] = coefI2] = O.Of; 
if(N<2) return 0; 

if(NULL = (m = ::matrix(1.2,l,2))) 
30 return 0; 

if(NULL = (v = ::matrix(U,l,l))) { 
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if(m) ::free_matrix(m,l,2,l,2); 

return 0; 

} 

v[l][l] = v[2][l] = 0.0f; 

sx = sx2 = 0.0; 

for(idx=l; idx<=N; idx++) { 

double x = double(px[idx]), y = double(py[idx]); 

sx+=x;sx2+=x*x; 

v[l][l]+=float(y); 

v[2][l]+= float(x*y); 

} 

muY = v[l][I]/float(N); 
m[l][l] = float(N); 
m[l][2] = m[2][lj = float(sx); 
m[2][2] = float(sx2); 
gaussj( m,2, v,l ); 
coeft0] = v[lj[l] ; 
coeftl] = v[2][lJ; 
for(idx=l;idx<=N;idx++) { 
double yh = coef[l ]*double(px[idx]) + coeffl)]; 
double y = double(py[idx]); 
ss_about_mu += (y-muY)*(y-muY); 
ss_due_reg -f= (yh-muY)*(yh-muY); 
ss_about_reg += (y-yh)*(y-yh); 
} 

if(N>2) 

coef[2] = float(ss_about_reg/double(N-2)); 
if(NULL !=prr) 

*prr = float(ss_due_reg/ss_about_mu); 
::free_matrix(m,l,2,l,2); 
::free_matrix(v,l,2,l,l); 
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return 1 ; 

} 

int 

linreg( float const* px, float const* py 7 int N, float* coef, float* prr) 
5 { 

float **m,**v,muY=0.0f; 

double sx, sx2, ss_about_mu=0.0, ss_due_reg=0.0, ss_about_reg s= 0.0; 
int idx; 

coellO] = coefl[l] » coef{2] = O.Of; 
10 if(N<2) return 0; 

if(NULL = (m = ::matrix(l,2,l,2))) 
return 0; 

if(NULL = (v = ::matrix(l,2,l,l))) { 
if(m) : :free_matrix(m, 1 ,2, 1 ,2); 
15 return 0; 
) 

v[l][l] = v[2][l] = 0.0f; 
sx = sx2 = 0.0; 
for(idx=l; idx<=N; idx-H-) { 
20 float x=px[idx], y=py[idxj; 

sx += x; sx2 += x*x; 

v[l][l]+=float(y); 

v[2][l]+=float(x*y); 

} 

25 muY = v[l][l]/float(N); 

m[l][l] = float(N); 

m[l][2] = m[2][l] = float(sx); 

m[2][2] = float(sx2); 

gaussj(m,2, v,l ); 
30 coeiI0] = v[l][l]; 

coefll] = v[2][l]; 
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for(idx=l;idx<=N;idx-H-) { 
double yh = coeftl]*double(px[idx]) + coefJO]; 
double y - double(py[idx]); 
ss_about_mu += (y-muY)*(y-muY); 
5 ssduereg += (yh-muY)*(yh-muY); 
ss_about_reg += (y-yh)*(y-yh); 
} 

if(N>2) 

coef{2] = float(ss_about_reg/double(N«2)); 
10 if(NULL != prr) 

*prr = float(ss_due_reg/ss_about_mu); 
::free_matrix(m, 1 ,2, 1 ,2); 
: :free_matrix(v, 1 ,2, 1 , 1 ); 
return 1; 

15 } 
int 

iquadratic( int const* px, int const* py, int N, float* quad) 

{ 

float **m,**v; 
20 double sx, sx2, sx3, sx4, uy; 
double sum; 
int idx; 

quad[0] = quad[l] = quad[2] = quad[3] = O.Of; 
if(N<=3) return 0; 

25 m = ::matrix(l,3,l f 3); 
if(!m) 
return 0; 

v = ::matrix(l,3,l,l); 
if(!v) { 

30 ; :free_matrix(ni, 1,3,1,3); 

return 0; 
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} 

v[l][l] = v[2][l]=v[3][l] = 0.0f; 
sx = sx2 = sx3 = sx4 = 0.0; 
for(idx=l; idx<=N; idx++) { 

double x = double(px[idx]), y = double(py[idx]); 

sx += x; sx2 += x*x; sx3 += x*x*x; sx4 += x*x*x*x; 

v[l][l]+=float(y); 

v[2][l]+= float(x*y); 

v[3][l]+=float(x*x*y); 

} 

uy = v[l][l]/float(N); 
sum = 0.0; 

for(idx=l; idx<=N; idx++) { 
double y = double(py[idx]); 
sum += (y-uy)*(y-uy); 

} 

sum/=double(N-l); 
if(0.0 = sum) 

quad[0] = float(uy); 
else { 

v[l][l] /= float(sum); 
v[2]ll]/=float(sum); 
v[3][l] /= float(sum); 

m[l][l] = float(float(N)/sum); 

m[l][2] = m[2][l] = float(sx/sum); 

m[l][3] = m[2][2] = m[3][l ] = float(sx2/sum); 

m[2][3] = m[3][2] = float(sx3/sum); 

m[3][3] = float(sx4/sum); 

gaussj( m,3,v,l); 

sum = 0.0; 

for(idx=l; idx<=N: idx++) { 
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double x = double(px[idx]), y = double(py[idx]), yprim; 
yprim = v[l][l] + v[2][l]*x + v[3][l]*x*x; 
sum += (yprim-y)*(yprim-y); 
} 

5 quad[0] = v[l][l]; 

quad[l] = v[2][l]; 
quad[2] = v[3][l]; 

quad[3] = float(sum/double(N-2-l)); 
} 

10 ::free_rnatrix(m,l,3,l,3); 
::free_matrix(v,l,3,l,l); 
return 1; 

} 

int 

15 dquadratic( double const* px, double const* py, int N, float* coef) 
{ 

float **m, **v; 

double sx, sx2, sx3, sx4, uy, sum; 
int idx; 

20 coef[0] = coefp ] = coefI2] = coeff3] = O.Of; 
if(N<=3) return 0; 
m= ::matrix(l,3,l,3); 
if(!m) 
return 0; 
25 v = ::matrix(l,3,l,l); 
if(!v) { 
: :free_matrix(m, 1,3,1,3); 
return 0; 

} 

30 v[l][l] = v[2][l] = v[3][IJ = 0.0f; 
sx = sx2 = sx3 = sx4 = 0.0; 
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for(idx=l; idx<=N; idx++) { 
double x=px[idx], y=py[idx]; 
sx +=x; 
sx2 += x*x; 
sx3 += x*x*x; 
sx4 += x*x*x*x; 
v[l][l]+= float( y); 
v[2][l]+=float(x*y); 
v[3][l]+= float( x*x*y); 
} 

uy = v[l][l]/float(N); 
sum = 0.0; 

for(idx=l; idx<=N; idx++) { 
double y = double(py[idx]); 
sum += (y-uy)*(y-uy); 
} 

sum /= double(N-l); 
if(0.0 — sum) { 
coef[0] = float(uy); 

coefll] = coef[2] = coefI3] = O.Of; 

} 

else { 

v[l][l]/=float(sum); 

v[2][l]/=float(sum); 

v[3][l]/=float(sum); 

m[l][l] = float(float(N)/sum); 

m[l][2] =m[2][l] = float(sx/sum); 

m[l][3] = m[2][2] = m[3][l] = float(sx2/sum); 

m[2][3] = m[3][2] = float(sx3/sum); 

m[3][3] = float(sx4/sum); 

gaussj( m,3, v,l); 
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sum = 0.0; 

for(idx=l; idx<=N; idx++) { 
double x = px[tdx], y = py(idx], yprim; 
yprim = v[l][l] + v[2][l]*x + v[3][l]*x*x; 
5 sum += (yprim-y)*(yprim-y); 

} 

coefI0] = float(v[l][l]); 
coefll] = float(v[2][l]); 
coefI2] = float(v[3][t]); 
10 coefI3] = float(sum/double(N-2-l)); 
} 

::frce_matrix(m, 1 ,3, 1 ,3); 
::free_matrix(v, 1 ,3 , 1 , 1 ); 
if(!finite(coefI3])) coefI3] = float(lE6); 
1 5 return 1 ; 

} 

/********************************************************************** 

* FILE: Quadratic.hxx 
20 * AUTHOR: Andy Marks 

* COPYRIGHT (c) 1996, University of Utah 
*/ 

#if !defined(_QUADRATIC_HXX_) 

* define _QUADRATIC_HXX_ 

25 void iquadratic(int const* x, int const* y, int n, float coelT4]); 

void dquadratic(double const* x, double const* y, int n, float coef[4]); 
#endif 

jit ift****** *** ************* ***************************************** 

30 * FILE: RatioBin.hxx 
♦AUTHOR: Andy Marks 
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*/ 

#include <stdio.h> 
#include <stdlib.h> 
#include <math.h> 
5 #include <basecall/RatioBin.hxx> 
#include <nrc/nr.hxx> 

RatioPattern::RatioPattern(int nfluor) : nfluorjnfluor), npatm_(0), chmrdy_(0) 
{ 

int pdx, rdx; 
1 0 for(pdx=0;pdx<MAXPTRN ;pdx-H-) { 
PATRN& rp « patrn Jpdx]; 
rp.frsiAtBand = rp.lastAtBand = 0; 
rp.nentry = rp.t3 = rp.t4 = 0; 
rp.r34 - 0.0; 
1 5 for(rdx=0;rdx<nfluor - ;rdx++) 

rp.rnsum[rdx] = rp.ratiojrdx] = O.Of; 

} 

#if 1 

PATRN* pp; 

20 pp = &patrn_[0]; pp->nentry^l ; pp->t3=3; pp->t4=l ; pp->r34=0.8f; 

pp->msum[0] = I .Of; pp->rnsum[l] = 0.5f; pp->rnsum[2] = 0.5f; pp->rnsum[3] = 0.8f; 

pp->ratio[0] = l.Of; pp->ratio[l] = 0.5f; pp->ratio[2] = 0.5f; pp->ratio[3] = 0.8f; 

pp = &patrnjl]; pp->nentry=l ; pp->t3=3; pp->t4=l ; pp->r34-0.6f; 

pp->rnsum[0] = 0.4f; pp->rnsum[l] - l.Of; pp->rnsum[2] = 0.3f; pp->rnsum[3] 0.3f; 
25 pp~>ratio[0] = 0.4f; pp->ratio[l] = l.Of; pp->ratio[2] = 0.3f; pp->ratio[3] - 0.3f; 

pp = &patrn_[2]; pp->nentry=l; pp->t3=l; pp->t4=2; pp->r34-0.3f; 

pp~>rnsum[0] = O.Of; pp->rnsum[l] = 0.3f; pp->rnsum[2] = l.Of; pp->msum[3] = O.lf; 

pp->ratio[0] = O.Of; pp->ratio[l] « 0.3f; pp->ratio[2] = l.Of; pp*>ratio[3] = O.lf; 

pp = &patrnJ3]; pp->nentry=l; pp->t3=3; pp->t4=2; pp->r34=0.8f; 
30 pp~>rnsum[OJ = 0.3f; pp->rnsum[l ] = 0.4f; pp->rnsum[2] = 1 .Of; pp->rnsum[3] = 0.8f; 

pp->ratio[0] = 0.3f; pp->ratio[I] « 0.4f; pp->ratio[2] « l.Of; pp->ratio[3] = 0.8f; 
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npatm_ = 4; 
#endif 

for(pdx=0;pdx<nfluor_;pdx++) 
for(rdx=0;rdx<nfluor_;rdx++) 
5 chm_[pdx][rdx] = (pdx=rdx)?l .0:0.0; 

} 

void 

RatioPattem::debug() const 

{ 

1 0 ::printf("P# #ent t3 t4 first last rat34 ratfl 1 ratfl2 ratfl3 ratfl4\n"); 
for(int pdx=0;pdx<MAXPTRN;pdx++) { 
PATRN const& rp = patmjjpdx]; 

::printf("%2d %4d %2d %2d %4d %4d %5.3f %6.3f %6.3f %6.3f %6.3f\n", 
pdx,rp.nentry,rp.t3,rp.t4,rp.frstAtBand,rp.lastAtBand,rp.r34. 
1 5 rp.ratio[0],rp.ratio[l ],rp.ratio[2],rp.ratio[3]); 

::printf("\n"); 

} 

} 

int 

20 RatioPattem::add( double const* obs, int bandnr ) 
{ 

double mind=4.0f; 
int minx— 1 , pdx, rdx, rv= 1 ; 
if(bandnr<=2 || npatrn_>=10) return 1; 
25 for(pdx=0;pdx<npatrn_;pdx++) { 
double eucd=0.0; 
for(rdx=0;rdx<nfluor_;rdx++) { 
double v = (patrn_[pdx].ratio[rdx]-obs[rdx]); 
eucd += v*v; 
30 } 

eucd = ::sqrt(eucd); 
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if(eucd<mind) { 
mind=eucd; 
minx=pdx; 
} 

5 } 

if((-l==minx || mind>MAXEUCD) && npatrn_<MAXPTRN) { 
PATRN& rp = patrnjminx = npatrn_++]; 
rp.frstAtBand = bandnr; 
rp.nentry-H-; 
1 0 for(rdx=0;rdx<nfluor__;rdx++) { 
rp.ratiofrdx] = obs[rdx]; 
rp.rnsumfrdx] = obsfrdx]; 

} 

mind = 0.0; 
15 } 

else if(mind<=MAXEUCD) { 
PATRN& rp = patrnjminx]; 
rp.lastAtBand = bandnr; 
rp.nentry++; 
20 for(rdx=0;rdx<nfluor_;rdx++) { 

rp.ratio[rdx] = ((FLTRX-1.0)*rp.ratio[rdx] + obs[rdx])/FLTRK; 
rp.rnsumfrdx] += obs[rdx]; 

} 

> 

25 else 

rv = 0; 
#if0 
if(l=rv) { 
PATRN& rp = patrnjminx]; 
30 ::printf(• , OBS[nn](%5.2f%5.2f%5.2f%5.2f)|%5.3q^ 
obs[0],obs[l],obs{2],obs[3],mind); 
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::printf(TAT[%2d]( 0 /o5.2f°/o5.2f%5.2f°/o5.2f)\n M , 
minx.rp.ratio[0],rp.ratio[ 1 ],rp.ratio[2],rp.ratio[3]); 

} 

#endif 
return rv; 

} 

void 

RatioPattern: :sortOnNentry_() 
{ 

for(int pdx=npatrn_-l ;pdx>0;pdx--) 
for(int tdx=0;tdx<pdx;tdx++) 
if(patmjtdx].nentry < patrn_[tdx+l].nentry) { 
PATRN tmp = patmjtdx]; 
patrnjtdx] = patrn_[tdx+l]; 
patrn_[tdx+ 1] = tmp; 
} 

} 

void 

RatioPattem::sortOnR34_() 

{ 

int pdx ? rdx, tdx; 
for(pdx=0;pdx<npatrn_;pdx-H-) { 

PATRN& rp = patrnjpdx]; 

double dtmp[4]; 

int itmp[4]; 

for(rdx=0;rdx<nfluor_;rdx++) { 
rp.ratio[rdx] = rp.rnsum[rdx]/double(rp.nentry); 
dtmp[rdx] = rp.ratiofrdx]; 
itmp[rdx] = rdx; 

} 

for(tdx=nfluor_- 1 ;tdx>0;tdx~) 
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for(rdx=0;rdx<tdx:rdx++) 
if(dtmp[rdx]>dtmp[rdx+l]) { 

double dt = dtmp[rdx]; dtmp[rdx) = dtmp[rdx+l ]; dtmp[rdx+l ] = dt; 
int it = itmp[rdx]; itmp[rdx] = itmp[rdx+l]; itmp[rdx+I] = it; 

} 

rp.r34 = (0.0=dtmp[3])? 1.0: dtmp[2]/dtmp[3]; 
rp,t4 = itmp[3]; 
rp.t3 = itmp[2]; 
} 

fo^pdx^nfluor^l ;pdx>0;pdx«) 
for(tdx=0:tdx<pdx;tdx++) 
if(patrnjtdx].r34 > patrn_[tdx+l].r34) { 
PATRN t = patm Jtdx]; 
patrnjtdx] = patm_[tdx+l]; 
patrnjtdx+l] = t; 
} 

} 

int 

RatioPattern::orderItJ) 

{ 

PATRN* ordered = new PATRNf nfluor_ ]; 
int pdx, rv = 1 ; 

for(pdx=0;pdx<nfluor_;pdx++) 

ordered [pdx).nentry = 0; 
if(NULL = ordered) 

return 0; 
for(pdx~0;pdx<nfluor_;pdx++) { 

PATRN const& rp = patrnjpdx); 

int mjr=rp.t4, mnr=rp.t3; 

if(0=ordered[mjr].nentry) ordered[mjr] = rp; 

else if(0=ordered[mnrJ.nentry) orderedfmnr] ~ rp; 
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else { 

int alt ™ ordered[mjr].t3; 
if(0=ordered[alt].nentry) { 

ordered[alt] = ordered[mjr]; 

ordered[mjr] = rp; 

} 

else { 

alt = ordered[mnr].t3; 
if(0— ordered[alt].nentry) { 
ordered[alt] = orderedfmnr]; 

orderedfmnr] = rp; 

i 
/ 

else { 
rv=0; 
goto bugout; 
} 

} 

} 

} 

for(pdx=0;pdx<nfluor_;pdx++) 
patrnjpdx] = ordered[pdx]; 
bugout: 

delete [] ordered; 
return rv; 

} 

double 

RatioPattem::chm(int rn, int cn) 

{ 

if(0=chmrdyj { 
sortOnNentryQ; 
sortOnR34_(); 
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if(l = (chmrdy_ = orderltj))) 
for(int pdx=0;pdx<nfluor_;pdx++) 
for(int tdx=0;tdx<nfluor_;tdx++) 
chm_[tdx][pdx] - patm_[pdx].ratio[tdx]; 

5 } 

return chm_[rn- 1 ] [cn- 1 ] ; 

} 

RatioBin::RatioBin( intnf ) : nfluorjnf), CHMJNULL), SST_(NULL), rpJnO 

{ 

10 if(nfluor_>l) { 

CHM_ = ::matrix(l.nfluor_,l.nfluor J; 

SST_ = ::matrix( 1 ,nfluor_, 1 ,nfluor_); 
} 

} 

1 5 RatioBin: :RatioBin( RatioBin const& rhs ) : 
nfluorJO), CHM_(NULL), SST_(NULL) 

{ 

*this = rhs; 

} . 
20 void 

RatioBin::release_() 

{ 

if(NULL != CHM J { ::free_matrix(CHM_, 1 ,nfluor_, 1 ,nfluorJ; CHM_ = NULL; } 
jffNULL != SSTJ { ::firee_matrix(SST_,l,nfluor_,l,nfluorJ; SST_ - NULL; } 

25 } 

RatioBin const& 

RatioBin::operator=( RatioBin const& rhs ) 

{ 

if(&rhs != this) { 
30 release_(); 

nfluor_ = rhs.nfluor_; 
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rp_ = rhs.rp_; 
if(nfluor_>l) { 

if(NULL — (CHM_ = ::matrix(l,nfluor^l,nfluorJ)) 
goto bugout; 

5 if(NULL = (SST_ = : :matrix( 1 5 nfluor_, 1 .nfluor J)) { 

::free_matrix(CHM__, 1 ,nfluor_,l ,nfluorJ; 
goto bugout; 

} 

for(int rdx=l;rdx<=nfluor_;rdx++) 
1 0 for(int cdx= 1 ;cdx<=nfluor_;cdx++) { 

CHM_[rdx][cdx] = rhs.CHMJrdx][cdx]; 
SST_[rdx][cdx] = rhs.SSTJrdx][cdx]; 
/ 

} 

15 } 
bugout: 

return *this; 

} 

RatioBin::-RatioBin() 
20 { 

released); 

} 

int 

RatioBin::cIassify( double const * smpl, int x ) 

25 { 

double mxv, normalized[4]; 
int idx. mxi; 
mxv = smpl[mxi=0]; 
for(idx=l ;idx<nfluor_;idx++) 
30 if(smpl[idx]>mxv) 

mxv = smpl[mxi-idx]; 
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for(idx=0;idx<nfluor_;idx++) 
normalized[idx] = smpl[idx]/mxv; 
(void)rp_.add( normalized, x ); 
return 1 ; 

5 } 
void 

RatioBin::debug(unsigned lvl) const 
{ 

int idx, cdx; 

1 0 ::printf("RatioBin @ %p\n",(void*)this); 
rp_.debug(); 

::printf("CHM @ %p:\n",(void*)CHMJ; 
if(NULL != CHMJ 
for(idx=l;idx<=nfluor_;idx++) { 

15 ::printf("\t"); 

for(cdx=l ;cdx<=nfluor_;cdx++) 
::printf("%8.5f\CHM_[idx][cdx]); 

::printfT\n"); 
} 

20 ::printf("SST @ %p:\n",(void*)SSTJ; 
if(NULL != SSTJ 
for(idx=l;idx<=nfluor_;idx++) { 

::printfi["\t"); 

for(cdx=l ;cdx<=nfluor_;cdx++) 
25 ::printf("%8.5f '\SST_[idx][cdx]); 

::printf("\n"); 
} 

} 

int 

30 RatioBin::analyze() 

{ 
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float** CNT = ::matrix(l,nfluor_,l,l); 
intidx.jdx; 

for(idx=l;idx<=nfluor_;idx-H-) { 
CNT[idx][l]= l.Of; 
5 for(jdx=lydx<=nfluor_;jdx++) 

CHM_[idx][jdx] = float(rp_.chm(idx,jdx)); 

> 

for(jdx=l ;jdx<=nfluor_;jdx++) 
for(idx=l ;idx<=nfluor_;idx-H-) 
10 SST_[idx][jdx] = CHM_[idx][jdx]; 

::gaussj( SST_, nfluor_. CNT, 1 ); 
#if 1 

for(idx=l;idx<=nfluor_;idx++) { 
intmxi=l; 
1 5 float mxv=SST_[mxi][idx]; 

for(j dx=2 ;j dx<=nfl uor_ J dx++) 
if(SST_[jdx][idx]>mxv) { 
mxv = SST_[jdx][idx]; 
mxi = jdx; 
20 } 

if(idx!=mxi) { 

::fprintf(stderr,"fluor(%d) has max SST value of %f for fluor(%d)\n",idx,mxv,mxi); 
SST_[l][l]=1.0f; SST_[l][2]=0.4f; SST_[l][3]=0.0f; SST_[l][4]=0.3f; 
SST_[2][l]=0.5f; SST_[2][2]=1.0f; SST_[2][3]=0.3f; SST_[2][4]=0.4f; 
25 SST_[3][l]=0.5f; SST_[3][2]=0.3f; SST_[3][3]=1 .Of; SST_[3][4]=1 .Of; 

SST_[4][l]=0.8f; SST_[4][2]=0.3f; SST_[4][3]=0.1f; SST_[4][4]=0.8f; 

::gaussj( SST_. nfluor_, CNT, 1 ); 

break; 

} 

30 } 
#endif 
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: :free„matrix( CNT, 1 ,nfluor_. 1,1); 
return 1; 

} 

* FILE: RatioBin.hxx 
♦AUTHOR: Andy Marks 

* Copyright © 1996 University of Utah 
*/ 

1 0 #if !defined(_RATIOBIN_HXX J 
#define RATI OB IN_HXX_ 
static int const MAXPTRN = 20; 
static double const FLTRK = 1 2.0; 
static double const MAXEUCD = 0.50; 

1 5 class RatioPattem 
{ 

public: 

RatioPattem( int nfluor=4 ); 
-RatioPattern() {}; 
20 int add(double const* obs, int bandNr); 
double chm(int rdx, int cdx); 
void debug() const; 
private: 
int nfluorj 
25 int npatrnj 
int chmrdy_; 
struct PATRN { 
int nentry; 
double ratio[4]; 
30 double rnsum[4]; 
double r34; 
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short t3, 
t4, 

frstAtBand, 
lastAtBand; 
5 } patrnJMAXPTRN]; 
double chm_[4][4]; 
void sortOnNentry_(); 
void sortOnR34 J); 
int orderIt_(); 
10 }; 

class RatioBin 

{ 

public: 

RatioBin( int nsmpl=4 ); 
1 5 RatioBin( RatioBin const& rhs ); 

RatioBin const& operator^ RatioBin const& rhs); 
-RatioBin(); 

int classify( double const* smpls, int scanline ); 
int analyze(); 

20 double sst(short row, short col) const { return SSTJrow][col]; } 

void debug(unsigned lvl=0) const; 
private: 

void release_(); 

int nfluor 
25 float** CHM_; 

float** SST_; 

RatioPattern rp_; 

}; 

#endif 

30 

f************^^t**********************^**^1H,*tt**** ** *********** ** 
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* FILE: RdrOutxxx 
♦AUTHOR: Andy Marks 

* COPYRIGHT (c) 1996, University of Utah 
*/ 

5 #include <basecall/mb.hxx> 
#inciude <nrc/CompIex.hxx> 
#include <basecall/sw.hxx> 
SSNODE::SSNODE() : 

fltwidJO), startJO), finishJO), SWoIdJO), threshJO.Of) 

10 { 

} 

void 

SSNODE::debug() const 
{ 

1 5 ::printf("SSNODE @ %p\n",(void*)this); 

::printf(" fltwid = %4d\n".fltwid()); 

::printf(" SWold = %4d\n",SWold()); 

::printf(" start = %4d\n\start()); 

::printf(" finish = %4d\n",finish()); 
20 ::printf(" rdlen = %4d\n",rdlen()); 

::printf(" thresh = %4.2An",thresh()); 

} 

QualCtrl::QualCtrl() : 

tbgnjO), tend_(0), nsegJO), skipseq_(NULL) 

25 { 

for(int idx=0;idx<MAXSEG;idx-)-+) 
bspacfidx] = fbwv_[idx] = 0; 

} 

char const* 

30 QualCtrl::shft( int idxO, char* buf. int buflen ) const 

{ 
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if((buflen>=15) && (idxO<nseg())) { 
ShftVect s = shft(idxO); 

::sprintf( buf, "p/olhd^hd^/oZhd^/oZhd]", s.s(l ),s.s(2)^s(3),s.s(4)); 
} 

5 else 

::strcpy( buf, "? H ); 
return (char const* )buf; 

} 

QualCtrl::QualCtrl(QualCtrl const& rhs) : 
1 0 tbgnJO), tend JO), nsegJO), skipseqJNULL) 
{ 

*this = rhs; 

} 

QualCtrl const& 
1 5 QualCtrl::opcrator=(QualCtrl const& rhs) 

{ 

if(this!=&rhs){ 

for(int idx=0;idx<MAXSEG;idx++) { 
shftjidx] = rhs.shftjidx]; 
20 fbwv_[idx] = rhs.fbwvjidx]; 

bspac_[idx] = rhs.bspac_[idx]; 

} 

tbgn_ = rhs.tbgn_; 

tend_ = rhs.tend_; 
25 nseg_ = rhs.nsegj 

cutdata_ = rhs.cutdataj 

if(skipseq_) delete [] skipsecL; 

skipseq_ = new char[ t-f ::strlen(rhs.skipseq_) J; 

ifl(skipseq_) ::strcpy( skipseq_, rhs.skipseq_ ); 
30 } 

return *this; 
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} 

QualCtrl::~QualCtrl() 
{ 

if(skipseqj { 
5 delete [] skipseq_; 
skipseq_ = NULL; 

} 

} 

void 

10 QualCtrl::debug() const 
{ 

::printf("QualCtrl @ %p\n",(void const*)this); 
::printf(" elapsed=%d(Sec)\n", tend_-tbgn_); 
cutdata_.debug(); 
15 for(int idx=0;idx<nseg_;idx++) { 

::printf(" idx=%d: fbw=%2d, bspac=%2d\t",idx ; fbwv_[idxl,bspac_[idx]); 

shft_[idx].debug(); 

} 

} 

20 RdrOut::RdrOut( Wvfm const& in ) : 

overlayldxjl), oiSlJ in.bgni()+l ), oPosLenJO), oPos_(NULL), previjl) 

{ 

::strcpy( lnordr_, in.lnordr() ); 
for(int idx=l ;idx<=OVRLAP;idx++) 
25 FM_[idx] = (-1 .0/double(OVRLAP-l))*idx+(double(OVRLAP)/double(OVRLAP-l)); 

} 

RdrOut::~RdrOut() 
{ 

if(NULL != oPosJ { 
30 ::free_ivector( oPos_, l,oPosLen_); 

oPos_ = NULL; 
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oPosLen_ = 0; 

} 

} 

int 

5 RdrOut;:add( int passnr, int fbv/, int medSP, SegRead const& sr ) 

{ 

int rv; 

qualctri_.shft( passnr- 1, sr.nsv() ); 
qualctrl_,fbw( passnr- 1, fbw ); 
1 0 qualctr]_.bspac( passnr- 1 , medSP ); 
qualctrl_.nseg( passnr); 
if(l = passnr) { 
bandStats_ = sr.bandStats(); 
traceOut_ = *sr.wvfm(); 
15 rv=l; 

} 

else if(l = (rv = was_at( sr, passnr ))) 
join( sr ); 
iffrv) { 

20 if(NULL != oPosJ 

::freejvector(oPos_, l,oPosLen_); 
oPosLen_ = sr.bandStats(),len(); 

if((0=oPosLenJ || (NULL = (oPosj=::ivector(l,oPosLenJ))) 
rv = 0; 

25 else for(int idx= 1 :idx<=oPosLen_;idx++) 
oPosJidx] - sr.bandStats().posn(idx-l); 

} 

return rv; 

} 

30 void 

RdrOut::join( SegRead const& sr ) 
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{ 

int lhsEndO, rhsBgnO.oscnhnscnl. IShortFalLrShortFall; 
lhsEndO = bandStats_.posn(overlayIdx_-l) + 1 - OVRLAP/2; 
rhsBgnO = sr.bandStats().posn(previ_-l) + 1 - OVRLAP/2; 
5 IShortFall = OVRLAP-(wvfm().rows()-lhsEndO+l ); 
rShortFall= 1-rhsBgnO; 
if(IShortFall > 0) { 

lhsEndO -= IShortFall; 

rhsBgnO -= IShortFall; 
10 } 

else if(rShortFall > 0) { 

lhsEndO += rShortFall; 

rhsBgnO += rShortFall; 

} 

1 5 for(int idx=l ; idx<=OVRLAP; idx++) { 

double osf = FMJidxl, nsf = FM_[OVRLAP-idx+l]; 
oscnl = IhsEndO+idx-l; 
nscnl = rhsBgnO+tdx-1 ; 
for(int lane=l ;lane<=4;lane++) { 
20 double ov, nv; 

ov = wvfm().sc_la( oscnl, lane ); 

nv = sr.wvfrn()->sc_la( nscnl, lane ); 

wvfin().sc_la_set( oscnllane, osf*ov + nsfnv ); 

} 

25 } 

wvfin().append( *sr.wvfm(), oscnl, nscnl+1 ); 
bandstat().append( sr.bandStats(),overlayIdx_-l,previ_ ); 

} 

void 

30 RdrOut::closesTo(int wasAt ? int& dist, int& j) const 
{ 
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int lodx=l, hidx=oPosLen_, tsti, tstp, htsti, ltsti; 
Jtsti = lodx; 
htsti = hidx; 
while(lodx <= hidx) { 
5 tstp = oPos_[ tsti = (hidx+lodx)/2 J; 

if(wasAt == tstp) { 

dist = 0; 

j = tsti; 

return; 
10 } 

else if(wasAt < tstp) { 
htsti = tsti--; 
hidx = tsti; 
} 

15 else { 

ltsti = tsti-H- ; 
lodx = tsti; 
} 

} 

20 int dlo = abs( wasAt - oPosJltsti] ), 
dhi = abs( oPosJhtsti] - wasAt ); 
if(dlo < dhi) { dist = dlo; j = ltsti; } 
else { dist = dhi; j= htsti;} 

} 

25 int 

RdrOut::was_at( SegRead const«& sr, int n ) 
{ 

static int const CHKORDR[] = {0,5,6,4,7,3,8,2,9,1,10}; 
# define NCHK ((si2eof(CHKORDR)/sizeof(CHKORDR[0]))-l) 
30 int mindist=4, xlation[3], cdx, K; 
intNband = sr.bandStats().len(): 
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int rv = 0; 

K = sr.iSl() - (oiSl_ + (n-2)*1900); 
for(cdx=l; cdx<=NCHK; cdx++) { 
int Cdx = CHKORDR[ cdx ]; 
5 for(int lane=0;lane<4;lane++) { 

if((Cdx<=Nband)&& (lnordr_[lane] = sr.bandStats().call(Cdx-l))) 
break; 

} 

if(5 != ++lane) { 
10 int wasAt, distj; 

wasAt = (sr.bandStats().posn(Cdx-l) - qualctrl().shft(n-l).s(lane)) + 

(K + qualctrl().shft(n-2).s(lane)); 
closesTo( wasAt. distj ); 
if(dist < mindist) { 
15 rv=l; 

xlation[ 1 ] = Cdx; 
xlation[2] = j; 
if(0 = (mindist=dist)) 
break; 

20 } 

} 

} 

if(l=rv){ 
overlayIdx_ += (xlation[2]-previ_); 
25 previ_ = xlation[l]; 

} 

return rv; 

} 

void 

30 RdrOut::Edit( char const* preamble ) 

{ 
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int Icutscnl, rcutscnl, NB; 
NB = bandStats_.len(); 
Icutscnl = bandStats.posn(O); 
rcutscnl = bandStats_.posn(NB-l); 
5 bandqual(); 

pickcuts( preamble ); 
qualctrl_.stopTimer(); 

} 

static int const FW[] = { 7, 9, 1 1, 13, 15 }; 
1 0 static float const TH[] = 

{ 0.74f. 0.75f, 0.76f, 0.77f, 0.78f, 0.79f, 0.80f }; 
static int const NW = (sizeof(FW)/sizeof(FW[0])), 
NT = (sizeof(TH)/sizeof(TH[0]»; 

void 

1 5 RdrOut::cutoff(int fdx, SSNODE& ss, int FFTSZ) const 

{ 

int NB = bandStats_.len(); 
int runlen=0, mxrl=0; 
ss.fltwid( FW[ fdx ] ); 
20 if(ss.fltwid() <= FFTSZ) { 

int LHS = (l+FW[fdx])/2, RHS = FW[fdx]-LHS; 
int anyGood=0, anyL2H=0, anyH2L=0; 
int idx, jdx, good, goodMl=0; 
for(idx=l;idx<=2*LHS;idx+=2) { 
25 HF_[ idx ] = 1 .0/double(ss.fltwid()); 

HF_[idx+l] = 0.0; 
} 

for(;idx<=2*(FFTSZ-RHS);idx++) 
HF_[ idx ] = 0.0; 
30 for(;idx<=2*FFTSZ;idx+=2) { 

HF_[ idx ] = 1 .0/double(ss.fltwid()); 
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HF_[idx+l] = 0.0; 
} 

::dfourl( HF_, FFTSZ, 1); 
::dCMul( XF_, HF_, AF_, FFTSZ ); 
5 : :dCMul( TF_, HF_, BF_, FFTSZ ); 
::dfourl(AF_, FFTSZ, -1); 
::dfourl(BF_, FFTSZ, -1); 
for(jdx=idx=l ydx<=2*NB;idx++,jdx+=2) { 
if(good = (AF_[jdx] >= BF_[jdx])) 
10 anyGood++; 
if(l !=idx) { 
int dg=good-goodMl ; 
if(dg>0)nL2HJ++anyL2H] = idx-1; 
else if(dg<0) nH2L_[++anyH2L] = idx- 1 ; 
15 } 

goodMl =good; 

} 

if(anyGood) { 
if(anyGood = NB) { 
20 ss.start( 1 ); 

ss.finish( NB ); 

} 

else if(0=anyL2H) { 
ss.start( 1 ); 
25 ss.finish( nH2L_[l] ); 

} 

else if(0=anyH2L) { 

ss.start( l+nL2H_[l] ); 

ss.finish(NB ); 
30 } 

else { 
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int *mL2H, *mH2L, mLn=anyL2H, mHn=anyH2L, lastH2L. lastL2H; 
mL2H = ::ivector(l,anyL2H+l); 
mH2L = ::ivector(l,anyH2L+l); 
lastH2L = nH2L_[ mHn J; 
lastL2H » nL2H_[ mLn ]; 
if(nH2L_[l]<nL2H_[l]) { 
mL2H[l]= 1; 

for(idx= 1 ;idx<=mLn;idx-H-) mL2H[idx+ 1 ] = nL2H_[idx]; 
mLn++; 

} 

else for(idx=l :idx<=anyL2II;idx++) mL2H[idx] = nL2H_[idx] ; 
for(idx=l;idx<=mHn;idx++) mH2L[idx] = nH2L_[idx]; 
if(lastH2L < lastL2H) 
mH2L[++mHn] = NB; 
if(mHn == mLn) { 
for(idx=l;idx<=mHn;idx++) { 
runlen = mH2L[idx]-mL2Hfidx]; 
if(runlen > mxrl) { 
mxrl = runlen; 
ss.start( mL2H[idx] ); 
ss.finish( mH2L[idx] ); 
} 

} 

::free_ivector(mL2H, l,anyL2H+l ); 
::free_ivector(raH2L, l,anyH2L+l ); 

} 

} 

} 

} 

} 

void 
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RdrOut::pickcuts( char const* preamble ) 

{ 

static float const WGHT[6][9) = 

{ 

5 {0.8944f,O.9076f,0.9208f,0.9340f,0.9472f,0.9604f,0.9736f,0.9868f,1.0000f}, 
{0.8755f,O.8885f,0.901 4f,0.9 1 43f,0.9272f,0.940 1 f,0.9530f,0.9660f.0.9789f} , 
{0.8567f,0.8693f,0.8819f,0.8946f,0.9072f,0.9199f,0.9325f,0.9451f,0.9578f}, 
{0.8378f,O.8501f,0.8625f,0.8749f,0.8872f,0.8996f,0.9119f,0.9243f,0.9367f}, 
{0.8189f,0.8310f,0.8430f,0.8551f,0.8672f,0.8793f,0.8914f,0.9035f,0.9155f}, 
10 {O.8000f,O.8118f,0.8236f,0.8354f,0.8472f ? 0.8590f,O.8708f,0.8826f.0.8944f} 

}; 

int jdx. idx, FFTSZ, NB = bandStats_.len(), mxrdlen=0; 
FFTSZ = int( ::pow( 2.0, ceil(::log(double(NB))/::log(2.0)) ) ); 
XF_ = ::dvector( 1, FFTSZ*2 ); 
15 TF_= : :dvector( 1 , FFTSZ*2 ); 
HF_= ::dvector( 1, FFTSZ*2 ); 
AF_= ::dvector( 1, FFTSZ*2 ); 
BF_= ::dvector( 1, FFTSZ*2 ); 
nH2L_= ::ivector( 1,NB ); 
20 nL2H_ = ::ivector( 1 ,NB ); 

for(jdx=idx=l ;idx<2*NBydx++,idx+=2) { 
XF_[ idx ] = double( bandStats_.qual(jdx-l) ); 
XF_[idx+l ] = 0.0; 
} 

25 for(; idx<=2*FFTSZ; idx++) 
XF_[ idx ] = 0.0; 
::dfourl(XF_, FFTSZ, 1); 
for(int tdx=0;tdx<NT;tdx++) { 
for(idx=l:idx<2*NB;idx+=2) { 
30 TF_[ idx ] = doublet THftdx] ); 

TF_[ idx+1 ] = 0.0; 
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} 

for(; idx<=2*FFTSZ; idx++) TF_[ idx | = 0.0; 
::dfourl(TF_,FFTSZ, 1); 
for(int fdx=0;fdx<NW;fdx++) { 
5 SSNODE ss; 

ss.thresh( TH[tdx] ); 
cutoff( fdx, ss, FFTSZ ); 
int wt_rdlen = int(WGHT[fdx][tdx] * ss.rdlen()); 
if(mxrdlen < wtrdlen) { 
1 0 mxrdlen = wt_rdlen; 

qualctrl_.cutdata( ss ); 
} 

} 

} 

1 5 ::free_dvector( XF_, 1 ,2*FFTSZ ); 
::free_dvector( HF_, 1,2*FFTSZ ); 
::free_dvector( TF_, 1,2+FFTSZ ); 
::free_dvector( AF_, 1,2*FFTSZ ); 
::free_dvector( BF_, 1,2*FFTSZ ); 
20 : :free_i vector( nH2L_, 1 ,NB ); 
::free_ivector( nL2H_. 1,NB ); 
if(NULL != preamble) { 
int lpre2x = ::strlen(preamble) * 2; 
int lobs = bandstat().len(); 
25 int len = (lpre2x<lobs)? Ipre2x: lobs; 
if(qualctrl().cutdata().start() < len) { 
char* obscopy = new char[ len+1 ]; 
if(NULL != obscopy) { 
static int const MAGIC_NR = 4; 
30 int idx, s; 

for(idx=0;idx<len;idx++) 
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obscopy[idx] = bandstat().call(idx); 
obscopy[idx] = '\0'; 
SW swalign( preamble, obscopy ); 
s = qualctrl().cutdata().start(); 
5 if(swalign.hcoord() > s) { 

int hslen = ::strlen(swalign.hout()); 
if(hslen>=MAGIC_NR) { 
int hitval = (swalign.score()*swalign.score())/hsIen; 
if(hitval>MAGIC_NR) { 
1 0 qualctrl().cutdata().S Wold( s ); 

qualctrl().cutdata().start( swalign.hcoord()+l ); 

} 

} 

} 

1 5 delete [] obscopy; 

} 

} 

} 

} 

20 void 

RdrOut::Beautify(int fbool.int bbool) 

{ 

if(l=fbool) flattenj); 
ifO=bbool) minNegSwingJ); 

25 } 
void 

RdrOut::flatten_() 
{ 

int idx, jdx, kdx, N, FFTSZ; 
30 double *sn, *hn; 

N = traceOut_.endi()-traceOut_.bgni()+l; 
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FFTSZ = int(::pow(2.0,::ceil(::log(double(N+129-l))/::log(2.0)»); 
sn = ::dvector(l,2*FFTSZ); 
hn = ::dvector(l,2*FFTSZ); 
for(idx=l; idx<=2*FFTSZ; idx^) 
5 hn[idx] « sn[idx] = 0.0; 

for(jdx= 1 ,idx=traceOut_.bgni(); jdx<=2*N; idx++, jdx+=2) 
sn[jdx] = traceOut_.enw(idx); 

for(kdx=2*FFTSZ-l jdx=idx=l; idx<=128; idx++jdx+=2.kdx-=2) 
hn[jdx] = hn[kdx] = 1.0/129.0; 
10 hn[jdx] = 1.0/129.0; 

::dfourl(sn, FFTSZ, 1 ); 
::dfourl(hn, FFTSZ, I ); 
::dCMul( sn, hn, sn, FFTSZ ); 
::dfourl(sn, FFTSZ, -1 ); 
15 for(kdx=l,idx=traceOut_.bgni(); idx<=traceOut_.cndi(); idx++,kdx+=2) { 
double sf = double(FFTSZ)/(DBL_EPSILON+sn[kdx]); 
for(jdx=l ;jdx<=4ydx++) 
traceOut_.sc_la_mul(idxjdx,sf); 

} 

20 ::free_dvector(sn,l,2*FFTSZ); 
: :frce_dvector(hn, 1 ,2*FFTSZ); 

} 

void 

RdrOut::minNegSwing_() 

25 { 

for(int lnr=l;lnr<=traceOut_.lanes();lnrH-) { 
double rainv = 0.0; 

int bgn=traceOut_.bgni(), end=traceOut_.endi(); 
for(int snr=bgn; snr<=end; snr++) { 
30 double v = traceOut_.sc_la(snr,lnr); 

if(v < minv) minv = v; 



SUBSTITUTE SHEET (RULE 26) 



WO 98/11258 



PCT/US97/16933 



167 

} 

if(0.0 > minv) { 
double sf = -0.02/minv; 
for(snr=bgn;snr<=end;snr-H-) 
5 if(traceOut_.sc Ja(snr,lnr) < 0.0) 

traceOut_.sc_la_mul(snr,lnr,sf); 

} 

} 

} 

10 int 

RdrOut::avgqual() const 

{ 

float aq = O.Of; 

int bgn = qualctrl().cutdata().start(), 
1 5 end = qualctrl().cutdata().finish(); 

BandStatArray const& bs = bandstat(); 
for(int idx=bgn;idx<end;idx-H-) 
aq += int(100.0f * bs.qual(idx)); 
return int(0.5f + aq/float(end-bgn+l)); 

20 } 
int 

RdrOut;:percentN() const 
{ 

BandStatArray const& bs = bandstat(); 
25 int bgn = qualctrl(),cutdata().start(), 
end = qualctrl().cutdata().finish(), 
ambig = 0; 
for(int idx=bgn;idx<end;idx++) 
if(lnordrJ4] = bs.call(idx)) 
30 ambig++; 

return int(0.5f + 100.0f*float(ambig)/float(end-bgn+l)); 
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} 

char const* 

RdrOut::sequence( char* buf, int buflen ) const 

{ 

5 BandStatArray const& bs = bandstat(); 

int len = (bs.len()<buflen)? bs.!en(): buflen; 
for(int idx=0;idx<len;idx++) 
buf[idx] = bs.call(idx); 
bufpdx] = '\0'; 
1 0 return (char const *)buf; 
} 

void 

RdrOut::debug() const 
{ 

1 5 ::printf( M RdrOut at %p\n H ,(void const*)this); 
::printf( M oiSl = %d\n",iSl()); 

::printf(" average quality = %2d, %%ambig = %2d\n H ,avgqual(),percentN()); 
qualctrl_.debug(); 
bandStats_.debug(); 
20 traceOut_.debug(); 
} 

* FILE: RdrOut.hxx 
25 * AUTHOR: Andy Marks 

* COPYRIGHT (c) 1996, University of Utah 
*/ 

#ifndef_RDROUT_HXX_ 
#define _RDROUT_HXX_ 
30 #include <time.h> 

#include <basecall/Metrics.hxx> 
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static int const MAXSEG - 6; 
static int const OVRLAP = 20; 
#ifdefined(WIN32) 
class _declspec( dllexport ) SSNODE 
5 #else 

class SSNODE 

#endif 

{ 

public: 
10 SSNODE0; 

-ssnodeo {;} 

int fltwid() const { return fltwid_; } 
int start() const { return start_; } 
int finish() const { return finish_; } 
1 5 int rdlen() const { return finish_-start_+ 1 ; } 
int SWold() const { return SWold_; } 
float thresh() const { return thresh_; } 
void fltwid(int w) { fltwid_ = w; } 
void start(int s) { start_ = s; } 
20 void finish(int f) { finish_ = f; } 

void SWold(int s) { SWold__ = s; } 
void thresh(float t) { thresh_ « t; } 
void debug() const; 
private: 
25 int fltwid_, 
start_, 
finish, 
SWold_; 
float thresh ; 

30 }; 

#ifdefined( WIN32) 
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class _declspec( dllexport) QualCtrl 
#else 

class QualCtrl 
#endif 
5 { 
public: 
QualCtrl(); 

QualCtrl( QualCtrl const& rhs ); 
QualCtrl const& operator=( QualCtrl const& rhs ); 
10 ~QualCtrl(); 

void shft( int idxO, ShftVect const& sv ) 
{ if(idxO<MAXSEG) shftJidxO] = sv; } 
void fbw( int idxO, int val ) 
{ if(idxO<MAXSEG) fbwvJidxO] = val; } 
1 5 void bspac( int idxO, int val ) 

{ if(idxO<MAXSEG) bspacJidxO] = val; } 
void nseg( int v ) { nseg_ = v; } 
void cutdata( SSNODE const& v ) { cutdata_ « v; } 
void ignoredSeq( char const* seq ); 
!0 void startTimer() { tbgn_ = time((timej*)NULL); } 

void stopTimer() { tend_ = time((time_t*)NULL); } 

int nseg() const { return nseg_; } 

SSNODE& cutdata() { return cutdata_; } 
SSNODE const& cutdata() const { return cutdata_; } 
5 char const* ignoredSeq() const { return NULL; } 

char const* shft( int idxO, char* buf, int buflen ) const; 
ShftVect shft( int idxO ) const 

{ return (idxO<nseg())? shftJidxO]: shftJO]; } 
int fbw( intidxO) const 
0 { return (idxO<MAXSEG)? fbwv_[idxO]: - 1 ; } 
int bspac( int idxO ) const 
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{ return (idxO<MAXSEG)? bspacJidxO]: -1 ; } 
time t runtimeO const { return tend -tbgn_+l ; } 

void debug() const; 
private: 

5 ShftVect shftJMAXSEG]; 
int fbwvJMAXSEG]; 
int bspacJMAXSEG]; 
time_t tbgn_, 
tend_; 
10 int nseg_; 

SSNODE cutdata_; 
char* skipseq_; 

}; 

#ifdefined(WIN32) 
1 5 class _declspec( dllexport) RdrOut 
#else 

class RdrOut 
#endif 

{ 

20 public: 

RdrOut( Wvfm const& in ); 
~RdrOut(); 

voidiSl(intiSl){oiSl_ = iSl;} 

int add( int passnr, int fbw, int medSP, SegRead const& sr ); 
25 int iSl() const { return oiSlj } 

int avgqual() const; 
int percentN() const; 

BandStatArray& bandstat() { return bandStatsj } 
BandStatArray const& bandstat() const { return bandStats_; } 
30 char const* sequence(char* buffer,int buflen) const; 

Wvfm& wvfm() { return traceOut_; } 
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Wvfm const& wvfm() const { return traceOut_; } 
QualCtrl& qualctrl() { return qualctrl_; } 
QualCtrl const& qualctrl() const { return qualctrlj } 
void Edit( char const* preamble = NULL ); 
void Beautify( int fbool, int bbool); 
void debug() const; 
private: 

int overlayIdx_, 
oiSl_, 
oPosLen_, 
*oPos_, 
previ_; 
char lnordr_[6]; 
BandStatArray bandStats_; 
Wvfm traceOut_; 
QualCtrl qualctrl_; 
double FMJ1+OVRLAP]; 
double *XF_, 
*TF^ 
*HF_, 
*AF_, 
*BF_; 
int *nH2L_ > 
*nL2H_; 

int was_at(SegRead const& sr, int passnr); 
void closesTo(int wasAt, int& dist, int& j) const; 
void join( SegRead const& sr ); 
void bandqual(); 

void pickcuts( char const* preamble ); 

void cutoff( int fdx, SSNODE& ss, int FFTSZ ) const; 

void flatten_(); 
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void minNegSwing_(); 

}; 

#endif 

^ ****************************************************** 

* FILE: SegReadxxx 
♦AUTHOR: Andy Marks 

* COPYRIGHT (c) 1996, University of Utah 
*/ 

10 #include <basecaWmb.hxx> 

SegRead. :SegRead( int iSl, int fluor ) : 
statusJSTSJNITD), iSl_( iSl ), nWvfJNULL), pmb_(NULL) 

{ 

pmb_ = new MB(fluor); 

15 } 

SegRead: :~SegRead() 

{ 

if(nWvf J { delete nWvf_; nWvf_ = NULL; } 
if(pmb_) { delete pmb_; pmb_ = NULL; } 

20 } 

SegRead::SegRead( SegRead const& rhs ) : 
status_(STS_UNINITD), iSl_(0), nWvf_(NULL), pmbJNULL) 

{ 

*this - rhs; 

25 } 

SegRead const& 

SegRead: :operator=( SegRead const& rhs ) 
{ 

if(this !=&rhs){ 
30 if(nWvfJ { delete nWvf_; nWvf_ = NULL; } 
if(pmbj { delete pmb_; pmb_ = NULL; } 
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nWvf_ - new Wvfm( *rhs.nWvf_ ); 
pmb_ = new MB( *rhs.pmb_ ); 
iSI_ =rhs.iSl_; 
nsv_ = rhs.nsv_; 
5 bsa_ = rhs.bsa_; 

status_ = rhs.status_; 
} 

return "this; 

} 

10 void 

SegRead::wvfm( Wvfm const& rhs ) 
{ 

if(NULL != nWvfJ { delete nWvf_; n Wvf_ = NULL; } 
nWvf_ = new Wvfm( rhs ); 

15 } 

float* 

SegRead::xbndara_( PKDET const& pks ) const 
{ 

int idx. N = pks.npk(); 
20 float* xbnd; 

if(NULL != (xbnd = ::vector(l,N))) 
for(idx=l; idx<=N; idx++) 

xbndf idx J = float(nWvf_->xbnd( pks.bmid( idx ) )); 
return xbnd; 

25 } 

float* 

SegRead-buzzaraJ PKDET const& pks ) const 

{ 

int idx. N = pks.npk(); 
30 float* buzz; 

if(NULL != (buzz = ::vector( 1 ,N))) 
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for(idx=l; idx<=N; idx++) 
buzz[ idx ] = float(nWvf_->buzz( pks.bmid( idx ) )); 
return buzz; 

} 

int 

SegRead::peakdet( int npts, PKDET& putative ) const 

{ 

double vml, v; 

int *ppk=NULL,Np=0, *ptr=NULL,Nt=0, idx; 
Wvfm const& wvfrn = *nWvf_; 
short MAXSV = nsv_.maxshft(); 
vml = wvfm.envv( MAXSV+1 ); 

v = wvfm.enw( MAXSV+2 ); 
ppk = ::ivector( 1 , npts ); 
ptr = ::ivector( 1 , npts ); 
if(NULL == ppk || NULL = ptr) 

return 0; 

enum STATE { ST_UK, ST_UP, ST DN } st = ST_UK; 
for(idx = MAXSV+2; idx <= npts; idx++ ) { 
switch(st) { 
case STUK: 
if(v>vml) st=ST_UP; 
else if(v < vml ) st=ST_DN; 
break; 
case ST_UP: 
if(v < vml) { st = ST_DN; ppk[ ++Np ] = idx-1 ; } 
break; 
case ST_DN: 
if(v > vml) { st = ST_UP; ptr[ ++Nt ] = idx-1 ; } 
break; 

} 
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vml = v; 

v = wvfm.enw( idx+1 ); 

} 

putative.set( ppk, Np, ptr, Nt ); 
5 ::free_ivector( ppk, l,npts ); 
::freeJvector( ptr, l,npts ); 
return 1; 

} 

void 

1 0 SegRead::maxlanecode_( int have, int* bcodes ) const 
{ 

short MAXSV = nsv().maxshft(); 
for(int sdx=MAXSV+l; sdx<=int(have); sdx++) { 
double THR = 0.80*nWvf_->envv(sdx); 
15 int ldx, code = 0; 

for(ldx=l ; 5!=code && ldx<=int(nWvf_->lanes()); ldx++) 
if(nWvf_->sc_la(sdx,ldx) >= THR) 
code = (0=code)?ldx:5; 
bcodes [ sdx ] = code; 
20 } 

} 

static void 

dfliplr(doubie* p, int n) 

{ 

25 int idx, ndx; 
double t; 

for(idx= 1 ,ndx=n;idx<=(n/2);idx-H-,ndx-) 
{ t = p[idx]; p[idx] = p[ndx]; p[ndx] = t; } 

} 

30 int 

SegRead::setBandStats(PKDET const& final ? int FBW,char const* seqant pass2) 
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{ 

int NPK = final.npk(), *insSP=NULL, *insWD=NULL; 
int rv = 1 ; 

double f_sigma = (double(FBW-l)*2.0*NR_PI)/2048.0; 
double t_sigma = 1 .0/f_sigma; 
int M = (final.medWidO + l)/2; 
float PATTCOEF[4], *xbnd, *buzz; 
if(l = pass2) { 

double *PX = ::dvector(l ,M), *PY = ::dvector(l ,M); 
int sts; 

if(!PX || !PY) return 0; 
for(int pdx=l;pdx<=M;pdx++) { 
double v; 

PX[pdx] = double(pdx); 

v = (PX[pdx]-((double(M)+1.0)/2.0))/(t_sigma/2.0); 

PYfpdx] = ::exp(-0.5*v*v); 

} 

sts = ::dquadratic( PX,PY, M, PATTCOEF ); 

::free_dvector(PX, 1 ,M); 

: :free_dvector(PY, 1 ,M); 

if(l != sts) return 0; 

} 

iffNULL = (insSP = ::ivector( 1, NPK ))) { 
status_ = STS_NO_MEM; 
return 0; 
} 

if(l != ::insMetric( final.bmid(), final.lgapO, insSP, NPK )) { 
status. = STS_TOO_FEW; 
::ftee_ivector(insSP, 1 ,NPK); 
return 0; 

} 
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if(NULL — (ins WD - ::ivector( I, NPK ))) { 
status, = STS_NO_MEM; 
: :free_ivector(insSP, 1 ,NPK); 
return 0; 
5 } 

if(l != ::insMetric( final.bmid(), final.bwid(), ins WD, NPK )) { 
status_= STS_TOO_FEW; 
: :free_i vector(insSP, 1 ,NPK); 
: :free_i vector(insWD t 1 ,NPK); 
10 return 0; 
} 

if(NULL == (xbnd = xbndara_( final ))) { 
status_ = STS_NO_MEM; 
: .freei vector(insSP, 1 ,NPK); 
15 ::free_ivector(insWD,l,NPK); 
return 0; 

} 

if(NULL == (buzz = buzzaraj final ))) { 
status, = STS_NO_MEM; 
20 ::free_ivector(insSP,l,NPK); 

::free_ivector(insWD,l,NPK); 
: :free_vector(xbnd, 1 ,NPK); 
return 0; 
} 

25 BandStatArray& bsa = bandStats(); 
bsa.init( NPK ); 

for(int idxO = 0; idxO <NPK; idxO++) { 
int idxl = idxO+1, scanl = final.bmid( idxl ); 
float onsv, ofsv; 
30 bsa.ntnr( idxO, idxl ); 

bsa.posn( idxO, scanl ); 
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bsa.hght( idxO, float(nWvf_->enw( scanl ) )); 
onsv = float(nWvf_->envv(final.bbgn(idxl))); 
ofsv = float(nWvf_->enw(finalbend(idxl))); 
bsa.lowv( idxO, onsv<ofsv?onsv:ofsv ); 
bsa.xbnd( idxO, xbnd[ idxl ] ); 
bsa.buzz( idxO, buzz[ idxl ] ); 
bsa.insr( idxO, final.ins(idxl)); 
bsaxall( idxO, seq[ idxl ] ); 
if(pass2) { 

double cc = -1.0; 

int N = final.bwid(idxl ); 

if(N>4) { 

double *BX, *BY, miny = 1000.0, cclr, ccrl; 

float bandcoefpt]; 

int wdx, scanl = final.bbgn(idxl); 

BX = ::dvector(l,N); 

BY = ::dvector(l,N); 

for(wdx=l ;wdx<=N;wdx++) { 

BX[wdx] = double(wdx); 

BY[wdx] = nWvf_->envv( scanl++ ); 

if(BY[wdx] < miny) miny = BYfwdx]; 

} 

for(wdx==l ;wdx<=N;wdx++) 
BY[wdx] -= miny; 

if(l != ::dquadratic( BX, BY, N, bandcoef )) { 
rv = 0; 

status, = STS_TOO_FEW; 

goto bugout; 

} 

cclr = ::corrcoef( PATTCOEF, bandcoef, 4 ); 
::dfliplr( BY, N ); 
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if(l != ::dquadratic( BX, BY,N, bandcoef )) { 
rv = 0; 

status, = STS_TOO_FEW; 
goto bugout; 

5 } 

ccrl = ::corrcoef( PATTCOEF, bandcoef, 4 ); 
cc = (cclr>ccrl)? cclr: ccrl; 
if(isnan(cc)) cc = -1.0; 
::free_dvector(BX, 1 ,N); 
10 ::free_dvector(BY,l,N); 
} 

bsa.shap( idxO, float(cc) ); 
double width, iwidth; 
width = double( final. bwid(idxl ) ); 
1 5 iwidth = double( insWD[ idxl ] ); 

bsa.widt( idxO, float(width/iwidth - 1 .0) ); 
bsa.bbgn( idxO, fmal.bbgn(idxl) ); 
bsa.bend( idxO, fmal.bend(idxl) ); 
float igap,slope,intercept, lfgap,rtgap,biggap,smlgap; 
20 igap = float(insSP[idx 1 ]); 

if(0.0f=igap){ 
bsa.lgap( idxO, 0.5f); 
bsa.sgap( idxO, 0.5f ); 
rv = 0; 
25 goto bugout; 

} 

else { 

igap = float(insSP[idxl]); 
slope = igap/4.0f; 
30 intercept = igap/2.0f; 

lfgap = float( final.lgap(idxl) ); 
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rtgap = float( final.rgap(idxl) ); 
if(lfgap>rtgap) { biggap = Ifgap; smlgap = rtgap; } 
else { biggap = rtgap; smlgap = lfgap; } 
if(biggap < slope) biggap = intercept - biggap*slope; 
5 if(smlgap < slope) smlgap = intercept - smlgap*slope; 

#if !defined(_WIN32) 

bsa.lgap( idxO, fmod(biggap,igap)/igap ); 
bsa.sgap( idxO, fmod(smlgap,igap)/igap ); 

#else 

10 bsa.lgap( idxO, float(fmod(biggap,igap)/igap) ); 

bsa.sgap( idxO. float(fmod(smlgap,igap)/igap) ); 
#endif 
} 

} 

15 } 
bugout: 

::free_vector( buzz, l,NPK);buzz = NULL; 
::free_vector( xbnd, l,NPK);xbnd = NULL; 
::free_ivector( ins WD, 1,NPK ); ins WD = NULL; 
20 ::free_ivector( insSP, 1 ,NPK ); insSP = NULL; 
return rv; 

} 

void 

SegRead::fbwlut( int spacing, int& fbw ) const 
25 { 

if(spacing < SMLGAP) spacing = SMLGAP; 
else if(spacing > BIGGAP) spacing = BIGGAP; 
fbw = pmb_->fbwlut_[ spacing ]; 

} 

30 

/*************************************** ****************************** 
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* FILE: SegRead.hxx 
♦AUTHOR: Andy Marks 

* NOTE: To use this easily include seqrdr.hxx, which 

* includes all the headers this needs. 
5 * COPYRIGHT (c) 1996, University of Utah 

*/ 

#ifndef _SEGREAD_HXX_ 
#define _SEGREAD_HXX_ 
static int const STATIC_BUF_SZ = NPTS/2; 
10 class MB; 

class PKDET; 
class SegRead 

{ 

public: 

1 5 enum Status { STS_UNINITD, STSJNITD, STS_NO_MEM, STS_BUF2SMALL, 
STS_TOO_FEW}; 
SegRead( int iSl, int fluor ); 
~SegRead(); 

SegRead( SegRead const& rhs ); 
20 SegRead const& operator=( SegRead const& rhs ); 

void iSl( int v) {iSl_ = v;} 

void wvfm( Wvfm const& rhs ); 

int iSl() const { return iSl_; } 

Wvfm* wvfm() { return nWvf_; } 

25 Wvfm const* wvfm() const { return nWvf_; } 

ShftVect& nsv() { return nsv_; } 

ShftVect const& nsv() const { return nsv_; } 

BandStatArrayA bandStats() { return bsa_; } 

BandStatArray const& bandStats() const { return bsa_; } 
30 void fbwlut( int spacing, int& fbw ) const; 

int fBandSpace( int& medSP ) const; 
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int nrefine( char const* Inordr, int FBW, int have, int p2 ); 
void blindeconv( Wvfm& sr, int Fbw ); 

int xtranorm( Wvftn& bd ? Wvfm const& source, int nscanl, int pass2 ); 

int setBandStats(PKDET const& fmaUnt FBW,char const* seq,int pass2); 
5 int peakdet( int npts, PKDET& putative ) const; 

Status status() const { return status; } 

void debug() const; 
private: 

Status status_; 
10 int iSl_; 

Wvfm *nWvf_; 

ShftVect nsv_; 

BandStatArray bsa_; 

Band nbandj STATIC_BUF_SZ ]; 
1 5 char seq_ [ STATIC_BUF_SZ ]; 

void maxlanecode_( int n, int* bcode ) const; 

float* xbndara_( PKDET const& putative ) const; 

float* buzzara_( PKDET const& putative ) const; 

int centroid_(int bgn, int end) const; 
20 MB *pmb_; 

}; 

#endif 

25 * FILE: seqrdr.hxx 
♦AUTHOR: Andy Marks 
* COPYRIGHT (c) 1996, University of Utah 
*/ 

#ifhdef_SEQRDR_HXX_ 
30 #define _SEQRDR_HXX_ 
static int const NPTS = 2048; 
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^include <stdio.h> 

^include <math.h> 

#include <limits.h> 

^include <float.h> 
5 #include <stddef.h> 

^include <stdlib.h> 

^include <string.h> 

^include <nrc/nr.hxx> 

^include <basecall/Wvfm.hxx> 
1 0 ^include <basecall/Metrics.hxx> 

#include <basecall/RdrOut.hxx> 

^include <basecall/AboutBQ.hxx> 

#if defined(sun) 

# include <ieeefp.h> 
15 #endif 

int d_cmp( void const*, void const* ); 

double corrcoef( float const* vl, float const* v2, int n ); 

int insMetric( int const* px, int const* py, int* ytmp, int N ); 

#if defmed( cplusplus) 

20 extern "C" { 

#endif 

int omitokn( int const* i, float const* h, float const* I, 

int const* lg, int const* rg, 

float const* x, int n, float** o); 
25 int gapcheck( int const* ig,int const* iw,int const* g,int const* w, 

char const* seq, int n, float** o); 

#if defined( cplusplus) 

} 

#endif 
30 #endif 
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1**********************************^**********^^^ *********** * ******** 

* FILE: ShftVect.cxx 
♦AUTHOR: Andy Marks 

* Copyright (c) 1995,1996, University of Utah 
*/ 

#include <basecall/mb.hxx> 
static const int 
NCUBES = 6, 

M = (27*NCUBES) - (NCUBES- 1 ); 
class Criterion 

{ 

public: 

Criterion() : valJO.O), idxJO) {;} 
-CriterionO {;} 

Criterion const& operatoi=( Criterion const& rhs); 
void debug( int lvl = 0 ) const; 
double val_; 
int idx_; 

}; 

class ShftVects 

{ 

public: 
ShftVects( int leh ); 
ShftVects( ShftVect const& sv ); 
ShftVects( ShftVects const& rhs ); 
ShftVects const& operator=( ShftVects const& rhs ); 
~ShftVects(); 

ShftVect align( Wvfm const& t ); 
void debug( int lvl = 0 ) const; 
private: 

void evaluate( Wvfm const& t ); 
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void best( int nelem ); 
int terminate(); 
int lenj 
ShftVect* svm_; 
5 Criterion* crit_; 
int mni_, mxi_; 
int loops_; 

}; 

ShftVect 

10 mcalign( Wvfm const& trace, ShftVect const& sv ) 
{ 

ShftVects svm( sv ); 
return svm.align( trace ); 

} 

15 class TriVects 
{ 

public: 
TriVectsO; 

TriVect const& operator[](int idx) const { return tvmjidx]; } 
20 -TriVectsO; 

void debug() const; 
private: 
int len_; 
TriVect* tvm_; 

25 }; 

Criterion const& 

Criterion: :operator=( Criterion const& rhs) 
{ 

if(this != &rhs) { 
30 idx_ = rhs.idx_; 
val_ - rhs.valj 
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} 

return *this; 

} 

void 

5 Criterion::debug( int lvl ) const 

{ 

::printf("Criterion: val=%8.3f idx=%2d\n",val_,idx_); 

} 

static const TriVects tvm; 
10 TriVect::TriVect() 

{ 

t[0] = t[l] = t[2] = 0; 

} 

TriVect::TriVect( TriVect const& rhs ) 
15 { 

t[0]=rhs.t[0]; 

t[l] = rhs.t[l]; 
t[2] = rhs.t[2]; 

} 

20 TriVect 

TriVect: :operator+( TriVect const& rhs ) const 

{ 

TriVect s( rhs ); 
s.t[0]+=t[0]; 
25 s.t[l]+=t[l]; 
s.t[2]+=t[2]; 
returns; 

} 

void 

30 TriVect: :debug( int lvl ) const 
{ 
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::printf("TriVect: [%hd,%hd,%hd]\n",t[0],t[ 1 ].t[2]); 

} 

TriVects::TriVects() : len_(M), tvm_(NULL) 

{ 

5 static int const radii[] = { 4, 8, 12, 16, 20, 24 }; 
int idx = 1 ; 

tvm = new TriVect[ len_ ]; 
if(NULL = tvmj { 

::fprintf(stderr,'TriVects::TriVects() out of memory An"); 
10 exit(l); 

} 

tvm_[0].t[0] = tvm_[0].t[l] = tvm_[0].t[2] = 0; 
for(int s=0;s<NCUBES;s++) { 
intko[3]; 

15 ko[0] = -radii[s]; ko[l] = 0; ko[2] = radii[s]; 
for(int i0=0; i0<3; i0++) 
for(intil=0;il<3;il++) 
for(int i2=0; i2<3; i2++) 
if(l!=i0|| l!=il || l!=i2) { 
20 tvm_[idx].t[0] = ko[i0]; 

tvm_[idx].t[l] = ko[il]; 
tvm_[idx].t[2] = ko[i2]; 
idx++; 

} 

25 } 
} 

TriVects: :~TriVects() 
{ 

if(tvm J { delete (] tvm_; tvm_ = NULL; len_ = 0; } 

30 } 
void 
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TriVects::debug() const 
{ 

::printf("TriVects @ %p\n",(void*)this); 
::printf("\tlen_ = %ld\n",lenj; 
for(int idx = 0; idx < len_; idx++) 
t vm_[ idx] . debug(); 

} 

ShftVect::ShftVect() 
{ 

s_[0] = s_[l] = s_[2] = s_[3] = 0; 

} 

ShftVect::ShftVect(short vl, short v2. short v3, short v4) 
{ 

s_[0] = vl ; s_[l] = v2; s_[2] = v3; s_[3] = v4; 

} 

ShftVect::ShftVcct( TriVect const& tv ) 
{ 

intmin= 1000; 
int idx; 

s_[0] = tv.t[0]+tv.t[l]+tv.t[2]; 
s_[l] = rv.t[l]+tv.t[2]; 
s_[2] = tv.t[2J; 
s_[3J = 0; 

for(idx = 0; idx < 4; idx++) 

if(s_[idx]<min) min = s_[idx]; 
for(idx = 0; idx < 4; idx++) 

s_[idx] -= min; 

} 

TriVect 

ShftVect::trivec() const 
{ 
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TriVect t; 
short mv = -s_[3]; 
t.t[2] = s_[2]+mv; 
t.t[l] = s_[l]-t.t[2]+mv; 
5 t.t[0] = s_[0]-t.t[ 1 ]-t.t[2]+mv; 
return t; 

} 

short 

ShftVect::maxshft() const 
10 { 

short mx = s_[0]; 
for(int idx=l;idx<=3;idx++) 
if(s_[idx] > mx) 
mx = s_[idx]; 
15 return mx; 
} 

ShftVect 

ShftVect::ralt() const 

{ 

20 short a,g,c,t, mn; 

a - sJO] + short(5.0*drand48()) - 2; 

g = sjl ] + short(5.0*drand48()) - 2; 

c - s J2] + short(5.0*drand48()) - 2; 

t = s J3] + short(5.0*drand48()) - 2; 
25 mn = a; 

if(g<mn) mn = g; 

if(c<mn) mn = c; 

if(t<mn) mn = t; 

ShftVect sv(a-mn,g-mn.c-mn,t-mn); 
30 return sv; 

> 
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void 

ShftVect::debug( int lvl ) const 

{ 

::printf("ShftVect @ %p:",(void*)this); 
5 ::printf("\ts_: [%hd,%hd,%hd,%hd]\n",s_[0],s_[l],sJ2],sJ3]); 

} 

ShftVects::ShftVects( int len ) : 

lenjlen), svm_(NULL), crit_(NULL), mniJO), mxi_(len_-l), loopsJO) 

{ 

1 0 svm_ = new ShftVect[ len_ ]; 
crit_ = new Criterionf len_ J; 
if(!svm_ || IcritJ { 

::fprintf(stderr,"ShftVects::ShftVects(%ld) out of memory.\n",len_); 

exit(l); 
15 } 

for(int idx = 0; idx < len_; idx++) { 

crit_[idx].idx_ = idx; 

crit_[idx].val_ = 0.0; 

} 

20 } 

ShftVects::ShftVects( ShftVects const& rhs ) : 
lenJO), svmJNULL), crit_(NULL), mniJO), mxiJO), loopsJO) 

{ 

♦this = rhs; 

25 } 

ShftVects: :ShftVects( ShftVect const& sv ) : 
lenJM), svm_(NULL), critJNULL), mniJO), raxi_(M-l), loopsJO) 

{ 

TriVect t = sv.trivec(); 
30 svm_ = new ShftVect[ ]en_ ]; 
crit_ - new Criterion[ len_ ]; 
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if(!svm_ || !crit_) { 

::fprintf(stderr,"ShftVects::ShftVects(ShftVect&) out of memory.Vn"); 

exit(l); 

} 

5 for(int idx = 0; idx < len_; idx++) { 
ShftVect sv( tvm[ idx ] + 1 ); 
svm_[idx] = sv; 
crit_[idx].idx_ = idx; 
crit_[idx].val_ = 0.0; 
10 } 

} 

ShftVects::~ShftVects() 

{ 

if(svm_) { delete [] svm_; svm_ = NULL; } 
1 5 if(crit J { delete [] crit_; crit_ = NULL; } 
} 

ShftVects const& 

ShftVects::operator=( ShftVects const& rhs ) 
{ 

20 if(this != &rhs) { 

if(svm_) { delete [] svm_; svm_ = NULL; } 
if(crit_) { delete [] crit_; crit_ = NULL; } 
len_ = rhs.len_; 
svm_ = new ShftVectf len_ ]; 
25 crit_ = new Criterion[ len_ ]; 
if(!svm_ || IcritJ { 
::fprintf(stderr," ShftVects: :operator= out of memory An"); 
exit(l); 

} 

30 for(int idx=0; idx<len_; idx++) { 
svmjidx] = rhs.svm_[idx]; 
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critjidx] = rhsxritjidx]; 
} 

mni_ = rhs.mni_; 
mxi_ = rhs.mxi_; 
5 loops_ = rhs.loops_; 

} 

return *this; 

} 

void 

1 0 Shft Vects: :evaluate( Wvfm const& trace ) 

{ 

for(int idx-mni_;idx<=mxi_;idx-H-) { 
static int const BGNPT - 299, ENDPT = BGNPT+1399; 
critj idx],val_ = 0.0; 
1 5 crit_[ idx ] .idx_ = idx; 

trace.envelope( svm_[idx] ); 
if(Wvfm::MAX_INTEGRAL == trace.method()) 
for(int sdx=BGNPT; sdx<=ENDPT; sdx++) 
critj idx ].val_ += trace.enw( sdx ); 
20 else 

for(int sdx=BGNPT; sdx<=ENDPT; sdx++) 
crit_[ idx ].val_ += trace.xbnd( sdx ); 

} 

} 

25 void 

ShftVects::debug(int lvl) const 

{ 

::printf("ShftVects @ %p\n",(void*)this); 
::printf( tt \tlen_ = %ld\n",lenj; 
30 for(int idx = 0; idx < len_; idx++) { 
::printf( rt \t%ld '\idx); 



SUBSTITUTE SHEET (RULE 26) 



WO 98/11258 



PCT/US97/16933 



194 

svm_[idx].debug(lvl); 
crit_[idx].debug(lvl); 

} 

::printf( M \tmxi = %ld mni = %ld loops_=%ld\n M ,mxi_,mni_,loopsJ; 

5 } 

static int 

c_cmp( void const* el , void const* e2 ) 

{ 

double vl = ((Criterion const*)el)->val_; 
1 0 double v2 = ((Criterion const* )e2)->val_; 
if(vl > v2) return 1; 
else if(vl < v2) return -1 ; 
else return 0; 

} 

15 void 

ShftVects::best( int topN ) 
{ 

ShftVects tmpSvs( topN ); 
tmpSvs Joops_ = Ioops_; 
20 ::qsort( (void*)crit_, len_, sizeof(*crit_J, c_cmp ); 
for(int tdx=0; tdx<topN; tdx++) { 
int sdx = len_ - topN + tdx; 
int fdx = critj sdx ].idx_; 
tmpSvs.svm_[tdx] = svm_[ fdx ]; 
25 tmpSvs.crit_[tdx] = crit_[ sdx ]; 
} 

*this = tmpSvs; 

} 

int 

30 ShftVects: :terminate() 
{ 
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static int const MAXLOOPS = 100; 
double ratio; 
mni_ = mxi_ = 0; 
for(int idx=l; idx<len_; idx-H-) { 
5 double v = crit_[idx].val_; 

if(v >= critjmxi J.val_) { mxi_ = idx; } 

if(v <= critjmnij.valj { mni_ = idx; } 

} 

svm_[mni J = svm_[ mxi_ ].ralt(); 
1 0 ratio = crit Jmni J . vaWcritJmxi J . val_; 

return int(((ratio > 0.97) && (loops_ >= 50)) || (loops_ > MAXLOOPS)); 

} 

ShftVect 

ShftVects::align( Wvfm const& trace ) 
15 { 

while(l) { 
evaluate( trace ); 
if(l = ++loopsJ 
best( 25 ); 
20 if(terminate()) 
break; 
else 
mxi_ = mni_; 

25 return svm_[mxi_J; 
} 

#ifdefmed(SA) 
int 

30 main(int argc, char const* argv[]) 
{ 
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ShftVect sv, rsv; 

Wvfm wvfm( Wvfm: : MIN_XB ANDING ); 
wvfm.debugO; 
rsv = mcalign( wvfm, sv ); 
5 : :printfi["Thc best alignemt is with "); 
rsv.debug(); 
return 0; 

} 

#endif 

10 

^**«« + t*4i«<»'**4i**4>>t>#*<*********'«>4i*«4>4>4>******«^*** **************** ******** 

* FILE: ShftVect.hxx 
♦AUTHOR: Andy Marks 

* Copyright (c) 1995,1996 University of Utah 
15 */ 

#ifhdef_SHFTVECT_HXX_ 
#defme _SHFTVECT_HXX_ 
struct TriVect 
{ 

20 TriVect(); 

TriVect operator+( TriVect const& rhs ) const; 
TriVect( TriVect const& rhs ); 
~TriVect(){}; 

void debug( int lvl = 0 ) const; 
25 short t[3]; 

}; 

#ifdefined(WIN32) 

class _declspec( dllexport) ShftVect 

#else 

30 class ShftVect 
#endif 
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{ 

public: 
ShftVectO; 

ShftVect( short vl , short v2, short v3, short v4 ); 
5 ShftVect( TriVect const& tv ); 
TriVect trivec() const; 
ShftVect ralt() const; 
void debug( int Ivl = 0 ) const; 
short s( int idx ) const { return s_[idx-l]; } 
1 0 short maxshft() const; 
private: 
short s_[4]; 

}; 

class Wvfin; 

1 5 ShftVect mcalign( Wvfm const& trace, ShftVect const& sv ); 
#endif 

/***** ************************** ************ ***************************** 

* FILE: spline.cxx 
20 * TYPIST: Andy Marks 

* Human Genetics Dept 

* UnivofUtah 

* DATE: Fri Mar 15 12:34:18 MST 1996 
V 

25 ^include <stdio.h> 
#include <nrc/nr,hxx> 
#if!defmed(SA) 
void 

spline( float const x[], float const y[], int n, float ypl, float ypn, float y2[] ) 
30 { 

int i, k; 
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float qn,un,*u; 
u = vector(l,n-l); 
if(ypl >0.99e30) 

y2[l]=u[l]=0.0f; 
5 else { 

y2[l]=-0.5f; 

u[l] = (3.0f/(x[2]-x[l]» * ((y[2]-y[l])/(x[2]-x[l]) - ypl); 

} 

for(i=2;i<=n-l;i-H-) { 
10 float p, sig; 

sig = (x[i]-x[i-l])/(x[i+l]-x[i-l]); 
p = sig*y2[i-l] + 2.0f; 
y2[i]=(sig-1.0f)/p; 

u[i] = (y[i+l]-y[i]) / (x[i+l ]-x[i]) - (y[i]-y[i-l]) / (x[i]-x[i-l]); 
15 u[i] = (6.0P u[i]/(x[i+l]-x[i-l]) - sig*u[i-l])/p; 

} 

if(ypn > 0.99e30) 

qn = un = O.Of; 
else { 
20 qn = 0.5f; 

un = (3.0f/(x[n]-x[n-l]))*(ypn-(y[n]-y[ti-l])/(x[n]-x[n-l])); 

} 

y2[n] = (un-qn*u[n-l])/(qn*y2[n-l]+1.0f); 
for(k=n-l;k>=l;k~) 
25 y2[k] = y2[k]*y2[k+l] + u[k]; 
free_vector(u, 1 ,n- 1 ); 

} 

void 

splint(float const xa[], float const ya[], float const y2a[], int n, float x, float *y) 
30 { 

static int klo.khi; 
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int k; 

float h,b,a; 

if(0=klo || 0=khi || x<xa[klo] || x>xa[khi]) { 
klo = 1 ; 
5 khi = n; 

while((khi-klo)> 1) { 
k = (khi+klo)»l; 
if(xa[k]>x) khi = k; 
else klo = k; 
10 } 
} 

h = xa[khi] - xafklo]; 

if(0.0f = h) nrerror("Bad xa input to routine splint: the xa's must be distinct"); 
a = (xa[khi]-x)/h; 
15 b = (x-xa[klol)/h; 

*y - a*ya[klo] + b*ya[khi] + ((a*a*a-a)*y2a[klo] + (b*b*b-b)*y2a[khi]) * (h*h)/6.0f; 

} 

#endif 

#ifdefined(SA) 
20 void 
main() 
{ 

float* x = vector( 1 ,4); 

float* y = vector(l,4); 
25 float* y2 = vector( 1 ,4); 

x[l ] = 1 .Of; x[2] = 2.0f; x[3] = 3.0f; x[4] = 4.0f; 

y[l ] = 1 .Of; y[2] = 2.0f; y[3] = 1 .Of; y[4] = 2.0f; 

::spline( x, y, 4, 1 .Of, 1 .Of, y2 ); 

for(float xt = 1 .Of; xt < 4.0f; xt += 1 .Of/3 .Of) { 
30 float yout; 

::splint( x, y, y2, 4. xt, &yout ); 
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::printf("xt=%fyout=%f\n M ,xt,yout); 
} 

} 

#endif 



* FILE: swxxx 
♦AUTHOR: Andy Marks 

* COPYRIGHT (c) 1996, University of Utah 
10 +/ 

#include <stdio.h> 

#include <limits.h> 

^include <string,h> 

#include <basecall/sw.hxx> 
15 #include <nrc/nrutil .hxx> 

SW::SW( char const* vseq, char const* hseq ) : 
vszJO), hszJO), outszJO), vseq_(NULL), hseqJNULL), 
score_(INT_MIN), vseqoutJNULL), hseqout JNULL), 
scoresJNULL), path JNULL), vcoordJO), hcoord_(0), 
20 vposOJO), hposOJO) 

{ 

if(!vseq || !hseq) return; 
vsz _= ::strlen( vseq ); 
hsz_= ::strlen( hseq); 
25 vseq_ = new char[ 1 + vsz_ + 1 ]; 
hseq_ ~ new char[ I + hsz_+ I ]; 
if(!vseq_ || !hseq_) { 
if(vseq_J delete [] vseq_; 
if(hseq_J) delete [] vseq_; 
30 vseq_ = hseq_ = NULL; 

return; 
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} 

::strcpy( &vseq_[I], vseq ); 

::strcpy( &hseq_[l], hseq ); 

int MVSZ=vsz_+l, MHSZ=hsz_+l; 

scores_ = ::imatrix( 1 ,MVSZ, 1 ,MHSZ ); 

path_ = ::imatrix( l,vsz_, l,hsz_ ); 

if(!scores_ || IpathJ) { 

delete [] vseq_; vseq_ - NULL; 

delete [] hseq_; hseq_ = NULL; 

if(scoresJ ::freejmatrix(scores_,UMVSZ,l,MHSZ); scores_ = NULL; 
if(path J ::freejmatrix(path_, 1 ,vsz_, 1 ,hsz_ ); path_ = NULL; 
return; 

} 

swj); 

for(int hdx=2;hdx<=MHSZ;hdx-r+) 
if(scores_[MVSZ][hdx] > scorej { 
vcoord_ = vsz_; 
hcoord_ = hdx-l; 
score_ = scores_[MVSZ][hdx]; 
} 

for(int vdx=2;vdx<=MVSZ;vdx++) 
if(scores_[vdx][MHSZ] > scorej { 
vcoord_ = vdx- 1 ; 
hcoord_ = hsz_; 
score_ = scores_[vdx][MHSZ]; 
} 

swwalk_(); 

} 

SW::SW(SWconst&rhs): 

vszJO). hszJO), outszJO), vseq_(NULL), hseqJNULL), 
scoreJINT_MIN), vseqout_(NULL), hseqout_(NULL), 
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scores JNULL), pathJNULL), vcoordJO), hcoord JO), 
vposOJO), hposOJO) 

{ 

*this = rhs; 

5 } 

SW const& 

SW::operator=( SW const& rhs ) 

{ 

if(this != &rhs) { 
10 if(vseqj { delete [] vseq_; vsecL = NULL; } 

if(hseqj { delete [] hseq_; hseq_ = NULL; } 

if(vseqout J { delete [] vseqout_; vseqout_ = NULL; } 

if(hseqout J { delete [] hseqout_; hseqout_ = NULL; } 

if(scoresJ { ::free_imatrix(scores_,l,vsz„+l,l,hsz_+l); scores_=NULL; } 
15 if(pathj { ::free_imatrix(path^l,vsz_,l,hszj; path_ =NULL; } 

vsz_ = rhs.vsz_; 

hsz_ = rhs.hsz; 

outsz_ = rhs.outsz_; 

score_ = rhs.score_; 
20 vpos0_ = rhs.vpos0_; 

hpos0_ = rhs.hposOj 

vseq_ = new char[ vsz_ + 2 ]; 

if(vseq_J ::memcpy( vseq_, rhs.vseq_, vsz_+2 ); 

hseq_ = new char[ hsz_ + 2 ]; 
25 if(hseq__) :;memcpy( hseq_, rhs.hseq_, hsz_+2 ); 

vseqout_ = new char[ outsz_ ]; 

if(vseqout_) ::memcpy( vseqout_, rhs.vseqout_, outsz_ ); 
hseqout_ = new char[ outsz_ ]; 

if(hseqout_J ::memcpy( hseqout_, rhs.hseqout_, outsz_ ); 
30 scores^ = : :imatrix( 1 t vsz_+ 1 , 1 ,hsz_+ 1 ); 
path_ = : :imatrix( 1 ,vsz_, 1 .hsz_); 
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if(scores_ && pathj { 
for(int vdx= 1 ;vdx<=vsz_+ 1 ;vdx++) 
for(int hdx=l;hdx<=hsz_+l;hdx++) { 
scores_[vdx][hdx] = rhs.scores_[vdx][hdx]; 
if(vdx<=vsz_ && hdx<=hsz_) 
path_[vdx][hdx] = rhs.path_[vdx][hdx]; 

} 

} 

} 

return *this; 

} 

SW::~SW() 
{ 

if(vseqj {delete [] vseqj vseq_ = NULL; } 

if(hseq_) {delete [] hseq_; hseq_ = NULL; } 

if(vseqout_) (delete [] vseqout_; vseqout_ = NULL; } 

if(hseqout_) {delete [] hseqout_; hseqout_ = NULL; } 

if(scores_) { : :free_imatrix( scores, 1 ,vsz_+ 1 , 1 ,hsz_+ 1 );scores_=NULL; } 

if(path_) {::free_imatrix( path_,l,vsz_,l,hsz_ );path_ = NULL;} 

} 

void 

SW::sw_() const 
{ 

static int const MATCHVAL = 1 , MISMATCH VAL = - 1 , 

VGAP_VAL = -3, HGAPJVAL = -3; 
int vdx, hdx; 

for(vdx=l ;vdx<=vsz_+l ;vdx++) 
for(hdx=l;hdx<=hsz_+l;hdx++) { 
scores_[vdx][hdx] = 0; 
if(vdx<=vsz_ && hdx<=hszj 
path Jvdx][hdx] = 0; 
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} 

for(int i=l;i<=vsz_;i-H-) 
for(int j=l;j<= ! hsz_;j-H-) { 

int val[4], I, m = (vseq_[i]=hseq_(j])? MATCHVAL: MISMATCHVAL; 
5 val[l] = scores_[i][j+l] + VGAP_VAL; 

val[2] = scores_[i][j] + m; 
val[3] = scores_[i+l][j] + HGAP_VAL; 
int mx = val[I=l]; 
for(int k=2;k<=3;k++) 
10 if(val[k]>mx) 
mx = val[I=k]; 
scores_[i+l][j + l ] = mx; 
path_[i][j] = l-2; 
} 

15 } 
void 

SW::swwalk_() 
{ 

static char const GAPCHAR = 
20 int idx. N, vpos, hpos ? pos; 

char *v, *h; 

N = vcoord_+hcoord_; 

v = new char[ 1+N+l ]; if(!v) return; 

h = new char[ 1+N+l ]; if(!h) { delete [] v; return; } 
25 pos = N; 

vpos = vcoord_; 

hpos = hcoord_; 

for(idx=0;idx<=(N+l);idx++) 
v[idx] = h[idx] = 0; 
30 while(vpos>0 && hpos>0) { 
switch( path_[vpos][hpos] ) { 
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case - 1 : v[pos] = vseqj vpos- ]; h[pos] = GAPCHAR; break; 
case 0: v[pos] = vseqj vpos- ]; h[pos] = hseqj hpos-- ]; break; 
case 1 : v[pos] = GAPCHAR; h[pos] = hseqj hpos- ]; break; 
default: 

5 ::fprintf(stderT/'SW::swwalkJ) encountered %d, not in [-l,..,l]\n M , 

path_[vpos][hpos]); 
delete [] v; 
delete [] h; 
return; 

10 } 

pos--; 

} 

pos++; 

vpos0_ = vpos; 
1 5 hpos0_ = hpos; 

if(vseqout J { delete [] vseqoutj vseqout_ = NULL; } 

if(hseqout_) { delete [] hseqout ; hseqout_ = NULL; } 

outsz_ = N-pos+1+1 ; 

vseqout_ = new char[ outsz_ ]; 
20 hseqout_ = new char[ outsz_ ]; 

if(vseqout_) ::strcpy( vseqout_, &v[pos] ); 

if(hseqout_) ::strcpy( hseqout_, &h[pos] ); 

delete [] v; 

delete [] h; 

25 } 
void 

SW:;debug() const 
{ 

::printf("SW @ %p\n'\(void*)this); 
30 ::printfC vsz_ = %2d hsz_=%2d outsz_=%2d\n'\vsz_ ? hsz_,outsz_); 
::printf(" vseq_ = [%s]\n ,, ? vsecL?&vseq_[l]: ,, nuir); 
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::printf(" hseq_= [%s]\n",hscq_?&hseq_[l]: M nuir'); 
::printf(" vpos0_ = %2d hpos0_ = %2d\n",vposO_,hposO _); 
: :printf(" score_=%2d\n",score J; 

::printf(" vcoord_ = %4d hcoord_ = %4d\n",vcoord_,hcoord_); 
::printf( M vseqout_ = [%s]\n'\vseqout_?vseqout_:"nuH"); 
::printf(" hseqout_ = [%s]\n".hseqout_?hseqout_:"null"); 
::printf("scores_:\n"); 
if(scores_) 
for(int vdx=l;vdx<=vsz_+l;vdx++) { 
::printf(" "); 

for(int hdx=l ;hdx<=hsz_+l ;hdx++) 
::printf("%2d ".scores_[vdx][hdx]); 
-printfCV); 
} 

::printf("path_:\n"); 
if(pathj 

for(im vdx=l;vdx<=vsz_;vdx++) { 
::printf(" "); 

for(int hdx=l ;hdx<=hsz_;hdx++) 

::printf("%2d ",path_[vdx]fhdx]); 
::printf("\n"); 
} 

} 

#ifdefmed(SA) 
int 

main() 
{ 

char const* transposon = "tccattggccctcaaacccc"; 
char const* observed = "tccattggccctccaaacccc" ; 
SW sw( transposon, observed ); 
sw.debug(); 
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return 0; 

} 

#endif 

5 f**************************** ****************** ********************* 

* FILE: sw.hxx 
♦AUTHOR: Andy Marks 

* COPYRIGHT (c) 1996, University of Utah 
*/ 

1 0 #if !defined(_SW_HXXJ 

#define_SW_HXX_ 

#ifdefmed(WIN32) 

class _declspcc( dllexport ) SW 

#else 
15 class SW 

#endif 

{ 

public: 

S W( char const* known, char const* unknown ); 
20 SW( SW const& rhs ); 

SW const& operator=(SW const& rhs); 
~SW(); 

int score() const { return score_; } 

int vcoord() const { return vcoord_; } 
25 int hcoord() const { return hcoord_; } 

int vpos0() const { return vpos0_; } 

int hpos0() const { return hpos0_; } 

char const* vout() const { return vseqout_; } 

char const* hout() const { return hseqout_; } 
30 void debug() const; 
private: 
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int vsz_, 

hsz_. 

outsz_, 

vposO_, 
5 hposO_; 
char *vseq_ t 

*hseq_; 
int score_; 
char *vseqout_, 
10 *hseqout_; 
int **scores_, 

**path_; 
int vcoord_, 

hcoord_; 
1 5 void sw_() const; 
void swwalk_(); 

}; 

#endif 



2Q y+*+%***** + ++4****+*+****+****+#+****++***+**^+**+**+****** 

* FILE: Wvfm.cxx 
♦AUTHOR: Andy Marks 

♦ COPYRIGHT (c) 1996, University of Utah 
*/ 

25 #include <basecall/mb.hxx> 

#include <nrc/nr.hxx> 

#include <nrc/Complex.hxx> 

#inciude <time.h> 

#include <basecalI/RatioBin.hxx> 
30 Wvfin::Wvfm() : 

rowsJO). colsJO), bgni_(0), endi_(0), dsJUNKNOWN), pm_(NULL), 
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pv_(NULL), piJNULL), px_(NULL), pbJNULL), methodJMAXJNTEGRAL), 
status_(STS_UNINITD), ib_(_2X), fCalcSSTJl ), obgniJO), oendiJO) 

{ 

InordrJO] = W; 
5 for(int idx=0;idx<4;idx++) 
for(int jdx=0y dx<4;jdx++) 
ssm_[idx]Ddx] = (idx=jdx)? 1.0: 0.0; 
status, = STS_NO_SIZE; 

} 

1 0 Wvfm: : Wvfm( char const* dataf, char const* parmf. char const* lnordr, DATASRC dsrc, 
Method m ) : 

rowsJO), colsJO), bgniJO), endiJO), dsjdsrc), pm_(NULL), 
pv_(NULL), piJNULL), px_(NULL), pb_(NULL), methodjm), 
status_(STS_UNINITD), ib_(_2X), fCalcSSTJl), obgniJO), oendiJO) 

int BBUFSZ; 

int** bbuf, rdx, cdx, scanl, idx, jdx; 
char scratch[ 128 ]; 
FILE* fp = NULL; 
20 for(idx=0;idx<4;idx++) 
for0dx=0ydx<4;jdx++) 
ssmjidx][jdx] = (idx=jdx)? 1.0: 0.0; 
if[!parmf) { 
if(Jnordr) { 
25 ::strcpy(lnordr_Jnordr); 
lnordr_[4] = '-'; 
Inordr_[5] = \0'; 
} 

else { 

30 status_ = STS_NO_LNORD; 

return; 



SUBSTITUTE SHEET (RULE 26) 



WO 98/11258 



PCT/US97/16933 



} 

} 

else if(NULL=(fp=::fopen(parmr"r n ))) { 
status, = STS_NO_PARMF; 
5 return; 
} 

else { 
if(NULL != lnordr) 
: :strcpy(lnordr_.lnordr); 
10 else { 

::fscanf( fp, "%s\n", scratch ); 
::fscanf( fp, "%s\n", lnordr J; 
::fscanf( fp, "%s\n", scratch ); 
} 

15 lnordr_[4] = 

lnordr_[5] = '\0*; 
for(idx = 0; idx < 4; idx++) 
if(4 != ::fscanf(fp,"%lf %lf %lf %lf\ 

&ssm_[idx][0], &ssm_[idx][l], &ssm_[idx][2], &ssm_[idx][3] )) { 
20 ::fprintf(stderr,"SSM file[%s] has [%s] instead of 4 doublesAn", 

parmf,scratch); 

::fprintf(stderr,"Wrong file, or incorrect file format.\n"); 

exit(l); 

} 

25 ::fclose(fp); 

fCalcSST_=0; 

} 

if(!dataf || (NULL = (fp = ::fopen(dataf,"r")))) { 
status_ = STS_NO_DATAF; 
30 return; 

} 
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#undef OLD_FMT 
#ifdefined(OLD_FMT) 

BBUFSZ= 12000; 

while(l) { 

5 if(NULL == ::fgets( scratch, sizeof(scratch), fp)) 
break; 

else if(0 = ::strncmp(scratch,"INTENSITY_DATA",14)) 
break; 

} 

10 #else 

char str[2][64]; 

int nfluor; 

float fs; 

int dummy; 
15 intNF; 

NF = ::fscanf(fp,"%s %s %d %d %f 
%d\n",strtO],str[ 1 ],&nfluor,&BBUFSZ,&fs,&durnmy); 

if(4 != nfluor) { 

::fprintf(stderr,"%s:%d, $3=%d, not 4\n M ,_FILE_,_LINE_.nfluor); 
20 status_ = STS_IFILE_UN; 
return; 
} 

if(1.75f != fs && 2.00f != fs) { 
::fprintf(stderr,"%s:%d, $4=%4.2f, neither 1.75 nor2.00\n",_FILE_,_LINE_,fs); 
25 status, = STS_IFILE_UN; 
return; 

} 

#endif 

if(NULL = (bbuf = ::imatrix( 1,BBUFSZ, 1,4 ))) { 
30 status, = STS_NO_MEM; 
return; 
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} 

for(scani=1; scan!<=BBUFSZ; scanl++) 
if(NULL = ::fgets( scratch, sizeof(scratch), fp )) 
break; 

5 else if(0 = ::strncmp(scratch,"INTENSITY_DATA_END:", 1 9)) 

break; 
else { 
float b[4]; 

int* p = bbuf[scanl]; 

10 if(4 != ::sscanf( scratch, "%f %f %f 0 /of',&b[0],&b[l],&b[2],&b[3])) { 

::free_imatrix( bbuf, 1,BBUFSZ,1,4 ); 
status_= STS_SCNL_RD_FAIL; 
return; 
} 

15 else { 

p[l] = int(b[0]+0.5f); 

p[2] = int( b[l]+0.5f); 

p[3] = int( b[2]+0.5f); 

p[4] = int(b[3]+0.5f); 
20 } 
} 

::fclose(fp); 
scanl-; 

if(NULL=(pm_=::dmatrix(l,rows_=scanl,l,cols_=4))) { 
25 ::free_imatrix( bbuf, UBBUFSZ, 1,4 ); 
status_ = STS_NO_MEM; 
return: 
} 

else for(rdx=l;rdx<=rows_;rdx-H-) 
30 for(cdx=l;cdx<=cols_;cdx++) 

pm_[rdx][cdx] = double( bbuf[rdx][cdx] ); 
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::free_imatrix(bbuf, 1,BBUFSZ, 1,4 ); 
status_ = STS_INITD; 

} 

Wvfm::Wvfm( int r, int c, char const* Inordr, DATASRC ds< Method m ) : 
5 rows_(r), colsjc), bgnijl), endiJO). dsjds), pm_(NULL), 

pv_(NULL), pi_(NULL), px_(NULL), pb_(NULL), method_(m), 
status_(STS_UNINITD), ibJ_2X), fCalcSSTJl), obgnijl), oendiJO) 

{ 

int idx.jdx, cdx, rdx; 
10 if(0 == endij endi_ = scanl(); 
lnordr_[0] = W; 
if(lnordr) 

::strcpy( lnordr_, Inordr ); 
pm_= ::dmatrix( l,rows_, l,cols_); 
1 5 for(cdx= 1 ; cdx<=cols_; cdx++) 
for(rdx=l ; rdx<=rows_; rdx++) 
pm_[rdx][cdx] = 0.0; 
for(idx=0;idx<4;idx++) 
for0dx=0;jdx<4;jdx++) 
20 ssm_(idx][jdx] = (idx=jdx)? 1 .0: 0.0; 

status_=STS_INITD; 

} 

Wvfm::Wvfin( Wvfin const& rhs ) : 
rows JO), colsJO), bgniJO), endiJO), pm_(NULL), 
25 pv JNULL), pi_(NULL), px_(NULL), pb JNULL), method_(MAX_INTEGRAL), 
ib_(_2X), fCalcSSTJl), obgniJO), oendiJO) 

{ 

♦this = rhs; 

} 

30 Wvfm const& 

Wvfm::operator=( Wvfm const& rhs) 
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{ 

if(this!=&rhs){ 
int NS, idx, jdx; 
release(); 
5 status^ = rhs.statusj 
rows_ = rhs.rows_; 
cols_ =rhs.cols_; 
obgni_ = rhs.obgni_; 
oendi_ = rhs.oendij 
10 bgni_ = rhs.bgni_; 
endi_ ^rhs.endij 
ds_ = rhs.ds_; 
method^ = rhs.method_; 
ib_ = rhs.ib_; 
1 5 fCalcSST_ = rhs.fCalcSST_; 

if(rhs.pm_ && 0!=rows_ && 0!=cols J 
if(NULL — (pm_ = ::dmatrix(hrows_,l,cols_))) { 
status^ = STS_NO_MEM; 
goto bugout; 
20 } 

else for(idx=l ;idx<=rows_;idx++) 
for(int jdx= 1 y dx<=cols_;jdx++) 
pm_[idx][jdx] = rhs.pm_[idx][jdx]; 
::strcpy(lnordr_,rhs.lnordr_); 
25 for(idx==0;idx<4;idx-H-) 
for(jdx=0;j dx<4;jdx++) 
ssm_[idx][jdx] = rhs.ssm_[idx](jdx]; 
NS=scanl(); 

if(rhs.pv_ && rhs.pi_ && rhs.px_ && ths.pbj { 
30 pv_= ;:dvector( 1,NS); 

px_= ;:dvector( UNS); 
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pi_ = ::ivector( 1, NS ); 
pb_ = ::vector( 1,NS); 
if(!pv_|| !pi_|| !px_|| !pbj{ 
status, = STS_NO_MEM; 
5 goto bugout; 

} 

for(idx=l;idx<=NS;idx-H-) { 
pv_[idx] = rhs.pv_[idx]; 
pi_[idx] = rhs.pi_[idx]; 
10 px_[idx] = rhs.px_[idx]; 

pb_[idx] = rhs.pb_[idx]; 
} 

} 

} 

1 5 bugout: 

return *this; 

} 

void 

Wvfm::release() 
20 { 

int NS = scanl(); 

if^NULL != pm_) { ::free_dmatrix( pm_,l,rows_,l,cols_); pm_ = NULL; } 
ifi(NULL != pvj { ::free_dvector( pv_, 1, NS ); pv_ = NULL; } 
if(NULL != pi J { ::free_ivector( pi_, 1, NS ); pi_ = NULL; } 
25 if(NULL != pxj { ::free_dvector( px_, 1 , NS ); px_ = NULL; } 
if(NULL != pbj { ::free_vector( pb_, 1. NS ); pb_ = NULL; } 

} 

Wvfm::~Wvtm() 
{ 

30 release(); 
} 



SUBSTITUTE SHEET (RULE 26) 



WO 98/11258 



PCI7US97/16933 



216 

struct VI { double v; int i,z ; } ; 
static int 

vi_cmp(void const* pi, void const* p2) 

{ 

5 double v 1 = ((VI const*)p 1 )->v; 
double v2 - ((VI const*)p2)->v; 
if(vl>v2) return 1; 
else if(v 1 <v2) return - 1 ; 
else return 0; 

10 } 
void 

Wvfm:;bgnEnd() 
{ 

if(bgni_>=2 && endi_>bgni_ && endi_<scanl()) { 
15 obgni_ = bgni_; 
oendi_ = endi_; 
return; 

} 

static int const NZ = 8; 
20 VIvi[l+NZ]; 

int ZSZ = scanl()/NZ; 
for(int nz=l;nz<=NZ;nz++) { 
int zb = l+(nz-l)*ZSZ, ze = nz*ZSZ, mi; 
double mv = sc_la(mi=zb,l); 
25 for(int s=zb+l;s<=ze;s-H-) { 
for(int f=2;f<=lanes();f++) { 
double v = sc_la(s,f); 
if(v>mv) { mv=v; mi=s; } 
} 

30 } 

vi[nz].v = mv; 
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vt[nz].i = mi; 
vi[nz].z= nz; 

} 

: :qsort(&vi[l ],NZ ? sizeof(VI),vLcmp); 
5 intmxl = vi[NZ].z; 
intmx2 = vi[NZ-l].z; 
double primerPeak; 
if(l=abs(mxl-mx2)) { 
bgni_ = vi[NZ].i; 
1 0 primerPeak = vi[NZ].v; 
endi_ = scanl()-100; 

} 

else if(mxl<mx2) { 
bgni_ = vi[NZ].i; 
1 5 primerPeak = vifNZJ.v; 
endi_ = vi[NZ-l].t-100; 
} 

else { 
bgni_ = vi[NZ-l].i; 
20 primerPeak = vi[NZ-l].v; 
endi_ = vi[NZ].i-100; 

} 

int B=bgni_; 

for(int ns=bgni_+3 0;ns<=bgni_+400;ns++) 
25 for(int f=l ;f<=lanes();f++) 

if(scja(ns,f) > 0.3* primerPeak) { 
if(bgni_=B) B-ns; 
continue; 
} 

30 double mu=0.0, nv=0.0; 

int m } f. LHS = (bgni_+endi J/2; 
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for(m=bgni_;m<=LHS;nH-+) { 
double lmx = sc_la(m,l); 
for(f=2;f<=lanes();f++) 
if(sc_la(m,f) > lmx) 
5 lmx = sc_la(m,f); 

mu += lmx; 
nv += 1.0; 
} 

mu /= nv; 
10 nv = 0.0; 

for(m=bgni_;m<=LHS;m-H-) { 

double lmx = sc_la(m,l); 
for(f=2;f<=lanes();f++) 

if(sc_la(m,f) > lmx) 
15 lmx = sc_la(m,f); 

if(0.0=nv && lmx<=mu) 

nv++; 

else if(0.0!=nv && lmx>mu) { 
bgni_ = m; 
20 break; 
} 

} 

#ifl 

endi_ -= 350; 
25 #else 

double denom, numer, nb; 

numer = double(endi_-bgni_+l); 

if(_lX==ibJ denom = 10.0; 

else if(_2X=ibJ denom = 7.0; 
30 else denom = 5.0; 

nb = numer/denom; 
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if(nb>=650.0) endi_ -= int((nb-650.0)*denom); 
#endif 

obgni_ = bgni_; 
oendi_ = endi_; 

5 } 

static void 

fsmPkdet(double const* vec,short b,short e,short& Pl,short& PRshort* pk,short* tr) 
{ 

enum STATE { STJJK, ST_UP, STDN } st = STJJK; 
10 short scnl,TN; 
double vm 1 ; 
TN = PN = 0; 
vml = vec[ scnl=b ]; 
for(-H-scnl; scnl<=e; scnl++) { 
1 5 double v = vec[scnl]; 
switch(st) { 
case STJJK: 
if(v>vml) st=ST_UP; 
else if(v < vml ) st=ST_DN; 
20 break; 

case ST_UP: 
if(v < vml ) { st = ST_DN; pk[ ++PN ] = scnl-1 ; } 
break; 

case ST DN: 

25 if(v > vml) { st = ST_UP; tr[ ++TN ] = scnl-1 ; } 

break; 

} 

vml = v; 
} 

30 Pl=l; 

if(pk[Pl]<tr[l])Pl++; 
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if(pk[PN] > tr[TN]) PN--; 

} 

static void 

bslAdjust( double* bsl, int bgn, int end ) 
5 { 

static int const LW =125; 
intN, FFTSZ; 

double* minv, *hn, *lpfbsl, *rawmin=&bsl[bgn-l]; 
# define REAL(n)(2*(n)-l) 
10 # define IMAG(n)(2*(n)) 
int c; 

N = end-bgn+1; 

FFTSZ = 1 «int(::ceil(::log(3*N+(2*LW-l)-l)/::log(2.0))); 
minv = ::dvector(l,2*FFTSZ); 
15 hn =::dvector( 1,2* FFTSZ); 
lpfbsl = ::dvector(l,N); 
for(c= 1 ;c<=2*FFTSZ;c++) 
minv[c] = hn[c] = 0.0; 
for(c=l;c<=N;c++) { 
20 double v = rawmin[c]; 

minv[REAL(N+ 1 -c)] =v; 
minv[REAL(N+c)] = v; 
minv[REAL(3*N+l-c)] = v; 

} 

25 ::dfourl( minv, FFTSZ, 1 ); 
for(c=l;c<=LW;c++){ 
double v = 0.5*(1.0 + ::cos(double(c-l)*NR_PI/double(LW)))/double(LW); 
hn[REAL(c)] » v; 

if(l !=c) hn[REAL(FFTSZ-c+2)] = v; 
30 } 

::dfourl( hn, FFTSZ, 1 ); 



SUBSTITUTE SHEET (RULE 26) 



WO 98/11258 



PCT/US97/16933 



221 

::dCMul( minv, hn, hn, FFTSZ ); 
::dfourl(hn, FFTSZ, -1 ); 
for(c=l;c<=N;c++) 

lpfbsl[c] = hn[REAL(N+c)]/double(FFTSZ); 
5 double BX=0.0, BY=0.0, BN=0.0, EX=0.0, EY=0.O, EN=0.0; 
double si, in; 
for(c=l;c<=N/10;c++) { 
double bdel=lpfbsl[c]-rawmin[c], 

ede!=lpfbsl[N+ 1 -c]-rawmin[N+l-c]; 
10 if(bdel>0.0) { BN++; BX+=double(c); BY+=bdel; } 
if(edel>0.0) { EN++; EX+=double(N+l-c); EY+=edeI; } 
} 

BX /= BN; BY /= BN; 
EX/=EN; EY /= EN; 
15 si = (EY-BY)/(EX-BX); 
in = EY - sl*EX; 
for(c=l;c<=N;c-H-) 

rawmin[c] = lpfbsl[c] - (double(c)*sI+in); 
::free_dvector(minv, 1 ,2*FFTSZ); 
20 ::free_dvector(hn, 1,2*FFTSZ); 
::free_dvector(lpfbsl, 1,N); 

} 

void 

Wvfin::fbbls_(int fnr, double* bsl, DIRECTION, DIR) const 
25 { 

#undef DOCPATENT 
#ifdefined(DOC_PATENT) 
static int sl=l; 
#endif 

30 double TAU=75.0; 

double prvmin, newmin, K, cnt=l .0, si, in. thr, dy,dx; 
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int lstknt,x, IC,FC,LC; 
if(FWD=DIR) { 

lstknt=bgni_; IC=bgni_+l; FC=endi_+l; 
} 

5 else { 

lstknt=endi_; IC-endi_-l; FC=bgni_-l; 
} 

prvmin = sc_la(lstknt,fnr); 
LC = IO10*DIR; 
1 0 for(x=IC;x!=LC;x+=DIR) 

if(0.0 != (prvmin^sc_la(x,fnr))) 
break; 

K = (prvmin/3.0)/(exp(l .0)-l .0); 
for(int snr=IC;snr!=FC;snrf=DIR) { 
15 thr = prvmin + K*(exp(++cnt/TAU)- 1 .0); 
if(0.0=(newmin=scJa(snr,fnr))) 
newmin = thr+1.0; 
if(newmin<=nhr || cnt>=100.0) { 
if(cnt>=!00.0 && thr<newmin) 
20 newmin = thr; 

dy = newmin-prvmin; 
dx = double(snr-lstknt); 
si = dy/dx; 

in = newmin - sl*snr; 
25 prvmin = newmin; 

for(x=lstknt;x!=snr;x+=DIR) { 
double v = double(x)*sl+in; 
bsl[x] - (FWD=DIR)? v: sqrt(v*bsl[x]); 
#if defined(DOC J>ATENT) 
30 if(3=fnr && 1— si) ::printf("%d %7.2f %7.2f %7.2f\n",DIR,scJa(x ) fnr),v,bsl[x]); 
#endif 
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} 

lstknt = snr; 
cnt= 1.0; 

K = (prvmin/3.0)/(exp(1.0)-1.0); 
5 } 
} 

snr -= DIR; 
if(sni=lstknt) { 
double v = double(snr)*sl+in; 
1 0 bsl[snr] = (FWD=DIR)? v: sqrt(v*bsl[snr]); 
#if defined(DOC_PATENT) 

if(3=fnr && l=sl) ::printf("%d %7.2f %7.2f %7.2f\n",DIR,sc_la(snr,fnr),v,bsl[snr]); 
#endif 
} 

15 else { 

if(thr<newmin) newmin = thr; 

dy = newmin-prvmin; 

dx = double(snr-lstknt); 

si = dy/dx; 
20 in = newmin - sl*snr; 

LC = snr+DIR; 

for(x=lstknt;x!=LC;x+=DIR) { 
double v = double(x)*sl+in; 
bsl[x] = (FWD=D1R)? v: sqrt(v*bsl[x]); 
25 #ifdefined(DOC_PATENT) 

if(3=fhr && l=sl) ::printf("%d %7.2f %7.2f %7.2f\n",DIR,sc_la(x > fnr),v,bsl[x]); 
#endif 
} 

} 

30 #ifdefined(DOC_PATENT) 

if(3=fnr && FWD!=DIR) { si = 0; exit(l); } 
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#endif 

} 

void 

Wvfm : : bestBaseline_() 
{ 

double *bsl = ::dvector(l,endi_); 
for(intfnr=l;fnr<=lanes();fnr++) { 
fbbls_( for, bsl, FWD ); 
fbbls_( fnr, bsl, BCK ); 
bslAdjust( bsl, bgni_, endi_ ); 
for(int snr=bgni_:snr<=cndi_;snr-H-) { 

double del = sc_la(snr,fnr)-bsl[snr]; 

if(del<0.0)del = 0.0; 

sc_la_set(snr,fnr, del); 

} 

} 

#ifO 

debugf AFTER BESTBASLINE"); 
#endif 

::free_dvector(bsl, 1 ,endi_); 

} 

int 

Wvfm::rawPks_( RatioBin& rb ) const 
{ 

short* ppk = new short[(endi_-bgni_+l)/2]; 
short* ptr = new short[(endi_-bgni_+l)/2]; 
double* penv = new double[l+endiJ; 
short lane, scnl, PI, PN; 
if(!ppk|| !ptr || Ipenv) 
return 0; 

for(scnl=bgni_;scnl<=endi_;scnl++) { 
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penvfscnl] = sc_la(scnl,l); 
for(lane=2;lane<=lanes();lane-H-) 
if(scja(scnl,lane)>penv[scnl]) 
penv[scnl] = sc_la(scnl,lane); 

5 } 

fsmPkdet( penv, bgni_, endi_, PI, PN, ppk, ptr); 
if(0=PN) 
return 0; 

for(short idx=Pl ; idx<=PN; idx++) { 
10 double colsmpl[4]; 
short x = ppk[idx]; 
for(lane= 1 ;lane<=lanes();lane-H-) 
colsmpl[lane-l] = scja(x,lane); 
rb.classify( colsmpl, idx ); 
15 } 

delete [] ppk; 
delete [] ptr; 
delete [] penv; 
return 1; 

20 } 
int 

Wvfin::specSep() 
{ 

int scnl, lane, mrow; 
25 if(0=bgni_ || 0=endi_ || bgni_>endi_ || 0=rows_ || 0=colsj { 
status^ = STS_CORRUPT; 
return 0; 
} 

#ifO 

30 debug("Before bestBaselineJ'); 
#endif 
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bestBaseline_(); 
#ifO 

debug(" After bestBaseline"); 
#endif 
5 if(fCalcSSTJ { 

RatioBin b{ lanes() ); 
Wvfm tmp = *this; 
if(l!=tmp.rawPks_(b)) { 
status_= STS_CORRUPT; 
10 return 0; 

} 

if(l != b.analyze()) { 
status_ = STS_CORRUPT; 
return 0; 

15 } 

for(short row=l ;row<=lancs();row-H-) 
for( short col=l ;col<=lanes();col++) 
ssm_[row-l][col-l] = b.sst(row,coI); 

} 

20 double minv[5]; 

minv[ 1 ] = minv[2] = minv[3] = minv[4] = 0.0; 
for(scnl=bgni_;scnl<=endi_;scnl++) { 
double temp [4]; 

for(mrow=0;mrow<lanes();mrow'H*) { 
25 double sum=0.0; 

for(Iane=l; lane<=lanes(); lane++) 
sum += scja(scnl,lane) * ssm_[mrow] [lane- 1 ] ; 
temp[mrow] - sum; 

if(sum<minv[mrow+l]) minv[mrow+l] = sum; 
30 } 

for(lane= 1 ;lane<=lanes();lane++) 
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scja_set(scnl,lane.temp[lane- 1 ]); 

} 

for(scnl=bgni_;scnl<=endi_;scnl-H-) 
for(lane=l;lane<=lanes();lane++) { 
double v = sc_la(scnl,lane); 

#if 1 

if(v<0.0) sc_la_set(scnl,lane,0.0); 
#else 

if(v<0.0) sc_la_set(scnl,lane, v / -minv[lane] ); 
#endif 

} 

#if 0 

debug("After SST applied"); 
#endif 
return 1; 

} 

void 

Wvfm::sort() 
{ 

int transposed = 0; 

if(rows_ > cols_) { 
transpose(); 
transposed = I ; 
} 

for(int ldx=l ; Idx <= rows_; Idx-H-) 

::qsort( &pm_[ldx][l], cols_, sizeof(double), d_cmp ); 
if(l — transposed) 

transpose(); 

} 

static void 
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bubbleswap(double* v, int* i, int il, int i2) 

{ 

double td; 
int ti; 

5 td =v[il];ti =i[il]; 
v[il] = v[i2];i[il] = i[i2]; 
v[i2] = td; i[i2] = ti; 

} 

void 

1 0 Wvfm::envelope( ShftVect const& sv ) const 

{ 

static float const MAXXBND = 2.0f; 
static float const MINXBND = l.Of; 
Wvfm* It = (Wvfm*)this; 
1 5 int scnl. SVMAX = 1 +sv.maxshft(); 
int mi. NS = scanlQ, NL = lanesQ; 



if(!pvj lt->pv_= ::dvector( 1,NS ); 
if(!pij lt->pi_= ::ivector( 1, NS ); 
20 if(!pxj lt->px_ = ::dvector( 1 , NS ); 
if(!pbj lt->pb_ = ::vector( 1, NS ); 
ifl[!pv_|| !pi_|| !px_ || !pbj{ 
lt->status_ = STS_NO_MEM; 
return; 
25 } 

for(scnl=l; scnl<SVMAX; scnl++) { 
lt->pv_[scnl] = lt->px_[scnl] = 0.0; 
lt->pb_[scnl] = O.Of; 
lt->pi_[scnl] = 0; 
30 } 

for(scnl=SVMAX; scnl<=NS; scnl++) { 
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double mx, smx, v[5]; 
int i[5]; 

v[l] = sc_la( scnl-sv.s(l), 1 ); i[l]=l; 

v[2] = sc_la( scnl-sv.s(2), 2 ); i[2]=2; 
5 v[3] = scja( scnl-sv.s(3), 3 ); i[3]=3; 

v[4] = sc_la( scnl-sv.s(4), 4 ); i[4]=4; 

if(v[ 1 ]>v[2]) bubbleswap(v,i, 1 ,2); 

if(v[2]>v[3])bubbleswap(v,i,2,3); 

if(v[3]>v(4])bubbleswap(v,i,3,4); 
1 0 if(v[ I ]>v[2]) bubbleswap(v,i, 1 ,2); 

if(v[2]>v[3])bubbleswap(v,i,2,3); 

if(v[l]>v[2])bubbleswap(v,i,l,2); 

mx = v[4]; smx = v[3]; mi=i[4]; 

lt->pv_[ scnl ] = mx; 
15 lt->pi_[ scnl ] = mi; 

if(v[l]<0.0)v[l] = 0.0; 

if(v[3] < 0.0) v[3] = 0.0; 

lt->pb_[ scnl ] = (v[4]!=v[3])? float((v[3]-v[l])/(v[4]-v[3])): 0.5f; 
if(lt->pbj scnl ] > 9.99f) lt->pb_[scnl] = 9.99f; 
20 if(smx < DBL EPSILON) smx = DBL_EPS1L0N; 
if(mx<0.1) 

mx = MINXBND + mx/sqrt(smx); 
else 
mx /= smx; 

25 if(mx > MAXXBND) mx = MAXXBND; 
else if(mx < MINXBND) mx = MINXBND; 
lt->px_[ scnl ] = mx; 
} 

} 

30 void 

Wvfm: :transpose() 
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Wvfm tmp( cols_, rows_, lnordr_ ); 
double **td; 

for(int rdx=l;rdx<=rows_;rdx++) 
5 for(int cdx= 1 icdxOcols^cdx-H-) 
tmp[cdx][rdx] = pm_[rdx][cdx]; 
rows_ = tmp.rows_; 
cols_ = tmp.cols; 
td = tmp.pm_; 
10 tmp.pm_ = pm_; 
pm_ = td; 
tmp.rows_ = cols_; 
tmp.cols_ = rows_; 

} 

15 void 

Wvfm::append( Wvfm const& rhs, int oscnl, int nscnl ) 
{ 

int NS = (rhs.endi()<rhs,scanl())? rhs.endi(): rhs.scanl(); 
Wvfm tmp( oscnl+(NS-nscnl+l ), 4, lnordr_ ); 
20 ShftVect noshift; 

int or=rows_, oc=cols_; 
double **td = tmp.pm_; 
for(int lane= 1 ;lane<=4;lane++) { 
for(int os=l josooscnUos+n-) 
25 tmp.scja_set( os, lane, scJa(os,iane) ); 

for(int ns=nscnl;ns<=NS;os++,ns-H-) 
tmp.scja_set( os, lane, rhs.sc_la(ns,lane) ); 

} 

rows_ = tmp.rows_; 
30 cols_ = tmp.cols_; 
tmp.pm_ = pm_; 
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pm_ = td; 
tmp.rows_ = or; 
tmp.cols_ = oc; 
bgni_ = tmp.bgni_; 
5 endi_ = tmp.endi; 
NS = scanl(); 

if(NULL != pvj { ::free_dvector( pv_, 1, NS ); pv_ = NULL; } 
if(NULL !- pi J { ::frcejvector( pi_, 1, NS ); pi_ « NULL; } 
if(NULL != pxj { ::free_dvector( px_, 1, NS ); px_ = NULL; } 
10 iffNULL != pb_) { ::free_vector( pb_, 1, NS ); pb_ = NULL; } 
envelope( noshift ); 

} 

void 

Wvfm;:lnordr( char const* lnordr ) 
15 { 

for(int idx=0;idx<6;idx++) 
lnordrjidx] = lnordrfidx]; 

} 

void 

20 Wvfm;:ssm( double const* pssm ) 
{ 

fCalcSST_ = 0; 
for(int idx=0; idx<4; idx++) 
for(int jdx=0; jdx<4; jdx++) 
25 ssm_[idx](jdx] = pssm[idx*4+jdx]; 

} 

void 

Wvfin::pm( double** pn, int r, int b. int e ) 

{ 

30 release(); 
pm_ = pn; 
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rows_ - r; 
bgni_ = b; 

endi_ = (e>rows_J? rows_: e; 

} 

5 void 

Wvfm::debug( char const* msg ) const 

{ 

static char const* METLUTf] = 

{ 

10 "MAXJNTEGRAL", "MIN_XBANDING" 

}; 

static char const* DSLUT[] = 

{ 

"UNKNOWN", "ABI "MDYN", "FLUOR", "TRUVEL" 

15 }; 

::printf("Wvfm @ %p\n",(void*)this); 
if(NULL != msg) 
::printf(" %s\n",msg); 
::printf("\trows=%u cols=%u bgni=%u endi=%u method=%s\n", 
20 rows_,cols_,bgni_.endi_, METLUT[method()] ); 

::printf("datasrc: [%s]\n", DSLUT[dsJ ); 
::printf("lnordr: [%s]\n",( , \0'==lnordr_[0]? "": lnordrj); 
if(ABI=ds_ || MDYN==ds_) { 
: :printf( "specsep:\n") ; 
25 for(int idx=0;idx<4;idx++) { 
::printf(" "); 

for(int jdx=0 jdx<4;jdx++) 

::printf("%9.6f",ssm_[idx][jdx]); 
::printf("\n"); 
30 } 

} 
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if(NULL = pmJ 

::printf(" pm = (null)\n"); 
else { 

int idx; 

5 ::printf(" pm @ %p\n",(void*)pmJ; 
if(rows_>cols_) 

for(idx = bgni_; idx <= endi_; idx++) { 
::printf("%4Id %1 1.61f %1 1 .6lf %1 1.6If %1 1 .61f\n", 
idx,pmjidx][l],pmjidx][2],pm_[idx][3],pmjidx][4]); 

10 } 

else for(idx = bgni_; idx <= endi_; idx++) { 
::printfr%41d %1 1.6lf %1 1.61f %1 1.61f %1 1.61f\n", 
idx,pm_[ 1 ] [idx],pm_[2] [idx] ,pm_[3] [idx],pm_[4] [idx] ); 

} 

15 } 

} 

/****** *** **************************** ***** **** ***** ******* 

* FILE: Wvfm.hxx 
20 * AUTHOR: Andy Marks 

* COPYRIGHT (c) 1996, University of Utah 
*/ 

#ifndef _WVFM_HXX_ 

#define _WVFM_HXX_ 
25 #include <basecall/ShftVect.hxx> 

class RdrOut; 

class SegRead; 

class RatioBin; 

#ifdefined(WIN32) 
30 class _declspec (dllexport) Wvfm 

#else 
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class Wvfm 
flendif 

{ 

public: 

5 enum INTERPOLATE JB Y { _3X, _2X, _1X } ; 

enum DATASRC { UNKNOWN, ABI, MDYN, FLUOR, TRUVEL 
enum Method { MAXINTEGRAL, MIN XBANDING }; 
enum Status 
{ 

10 STS_UNINITD, 

STSJNITD, 

STSNOMEM, 

STS_NO_SIZE, 

STS_NO_PARMF, 

15 STS_NO_DATAF, 

STS_SCNL_RD_FAIL, 

STS_CORRUPT, 

STS_NO_LNORD, 

STS_IFILE_UN 
20 }; 

Wvfm(); 

Wvfm(char const* dfn,char const* pfn,char const* lnordr=NULL, 
DATASRC ds=MDYN, Method m=MAX_INTEGRAL ); 
25 Wvfm( int rows, int cols, char const* Inordr, 

DATASRC ds=MDYN, Method m=M AX INTEGRA L ); 
Wvfm( Wvfm const& rhs ); 
Wvfm const& operator=( Wvfm const& rhs ); 
~Wvfm(); 
[ 0 void lnordr( char const* Inordr ): 
void ssm( double const* pssm ); 
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void ds( DATASRC ds ) { ds_ = ds; } 
void smpiRate( INTERPOLATE^ Y ib ) { ib_ = ib; } 
void rows(int r) { rows_ = r; } 
void cols(int c) { cols_ = c; } 
5 void bgni(int b) { bgni_ = b; } 
void endi(int e) { endi_ = e; } 
void sc Ja_set(int s,int l,double v) const { 

if(rowsj>cols_) pmjs][l] = v; 

else pmjl][s] = v; 
10 } 

void sc Ja_mul(int s.int I,double v) const { 
if(rows_>cols_) pmjs][l] *= v; 
else P*njl][s] *~ v ; 

} 

15 void append( Wvfm const& rhs, int oscnl, int nscnl ); 

int scanl() const { return rows_>cols_? rows : colsj } 
int lanesQ const { return rows_<cols_? rowsj colsj } 
DATASRC ds() const { return ds_; } 

double sc_la(int s,int 1) const 

20 { return rows_>cols_? pm_[s][l]: pm JI][s]; } 

double const* ssm() const { return ssm JO]; } 



void 


envelope( ShftVect const& sv ) const; 


int 


rows() 


const { return rows_; } 


int 


cols() 


const { return colsj } 


int 


obgni() 


const { return obgni_; } 


int 


oendi() 


const { return oendi_; } 


int 


bgni() 


const { return bgnij } 


int 


endi() 


const { return endi_; } 



double envv( size_t idx ) const { return (pvj? pvjidx]: 0.0; } 
30 int envi( size J idx ) const { return (pi J? pi Jidx]: 0; } 

double xbnd( size_t idx ) const { return (px_)? pxjidx]: 0.0; } 
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float buzz( sizej idx ) const { return (pb_)? pbjidx]: O.Of; } 
Method method() const { return method_; } 
char const* lnordr() const { return lnordr_; } 
void debug(char const* msg = NULL) const; 
5 int nfeeder( RdrOut& ro, int fVb=0 ); 
int unstop(); 
void sort(); 

Status status() const { return status_; } 

private: 
1 0 int rows_; 

int cols_; 

int bgni_; 

int endi 

DATASRC ds_; 
15 double **pm_; 



double* pv_; 

int* pi; 

double* px_; 
20 float* pb_; 

Method method ; 

char lnordr_[6]; 

double ssm_[4][4]; 

Status status_; 
25 lNTERPOLATE_BY ib_; 

char fCalcSST_; 

int obgni_, 
oendi_; 

int preproc(); 

30 int nreader( int Fbw, int finalRead, SegRead& segrd ); 
void release(); 
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void bgnEnd(); 
int specSep(); 

void sc Ja_sub(int sJnt l,double v) const { 
if(rows_>cols_) pm_[s][l] -= v; 
5 else pm_[l][s] — v; 
} 

void pm( double **p, int r, int b, int e ); 
void transpose(); 
void truvelAdjust(); 
10 void leastSquareBaselineQ; 
void baseline( int N ); 
void noZeros(); 
int mdynpre(); 

double* dmscanlsum() const; 
1 5 double* dmscanlprod() const; 

double* operator[](int i) { return pm_[i]; } 

int t2tBaselineSubt J); 

int rawPks_( RatioBin& rb) const; 

void bestBaseline_(); 
20 enum DIRECTION, { BCK=-1, FWD=1 }; 

void fbblsjint fnr, double* b, DIRECTION_ D) const; 

>; 

#endif 



25 /*♦****♦*********»*********♦**********»**********************•****** 

* FILE: xtranorm.cxx - Extra Normalization to compensate for 

* a band-lite (as opposed to other band-rich) lanes. 
♦AUTHOR: Andy Marks 

* COPYRIGHT (c) 1996, University of Utah 
30 */ 

#include <basecall/mb.hxx> 
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struct SD 

{ 

double v; 
int i; 

5 }; 

double* 

W vf m : : dmscanlprod() const 

{ 

double *p; 

1 0 if(NULL != (p = ::dvector( 1 ,scanl()))) 
for(int sdx=l; sdx<=scanl(); sdx++) { 
p[sdx] = sc_la(sdx,l); 
for(int Idx=2; ldx<=lanes(); ldx-H-) 
pfsdx] *= sc_la(sdx,ldx); 

15 } 

return p; 

} 

double* 

Wvfm::dmscanlsum() const 
20 { 

double *p; 

if(NULL != (p = ::dvector(l,scanl()))) 
for(int sdx=l;sdx<==scanl();sdx-H-) { 
p[sdx] = sc_la( sdx, 1 ); 
25 for(int Idx=2;ldx<=lanes();ldx++) 

pfsdx] += sc_la( sdx, ldx ); 

} 

return p; 

} 

30 static void 

stats( double const* v. int n, double& ran. double& std ) 
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{ 

int idx; 

mn = std = 0.0; 
forOdx^l ;idx<=n;idx++) 

mn += v[idx]; 
mn /= double(n); 
for(idx : =l;idx<=n;idx-H-) { 

double del = (v[idx]-mn); 

std += del*del; 

} 

std = sqrt( std/double(n) ); 

} 

int 

Wvfm::unstop() 

{ 

int testagain = 1 ; 
while(l == testagain) { 
double *p, *s, pmn,pstd, smn,sstd; 
double pmz, smz, geoMean; 
int pmu smi; 
int ldx. sdx; 
testagain = 0; 
p = dmscanlprod(); 
if(NULL == p) return 0; 
s = dmscanlsum(); 
if(NULL == s) return 0; 
stats( p, scanl(), pmn, pstd ); 
stats( s. scanl(), smn, sstd ); 
pmz=p[l];pmi=l; 
smz=s[l); smi=l; 
if(0.0 = pstd || 0.0 == sstd){ 
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::free_dvector( p, 1, scanl() ); 
::free_dvector( s, 1, scanl() ); 
return 0; 
} 

5 for(sdx=l ; sdx<=scanl(); sdx++) { 
p[sdx] = (p[sdx)-pmn)/pstd; 
s[sdx] = (s[sdx]-smn)/sstd; 
if(p[sdx] > pmz) { pmz = p[sdx]; pmi = sdx; } 
if(s[sdx] > smz) { smz - s[sdx]; smi = sdx; } 
10 } 

::free_dvector( p ; 1, scanl() ); 
::free_dvector( s, I, scanl() ); 
ifl[10 > ::abs(pmi-smi)) { 
geoMean = ::sqrt( pmz * smz ); 
15 if(geoMean > 9.5) { 

int bgn, end; 
testagain = 1 ; 

bgn = (smi>25)?smi-25: 1; 
end = ((smi+25) < scani())? smi+25: scanl(); 
20 for(ldx= 1 ; ldx<=4; Idx++) 

for(sdx=bgn; sdx<=end; sdx++) 
sc Ja_set( sdx, ldx, 0.0 ); 

} 

} 

25 } 

return 1; 

} 

static int 

sumi2j(inti, intj) 
30 { 

int t; 
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if(i>j) { 
t = i;i=j;j=t; 

} 

returnj*(j+l)/2-i*(i-l)/2; 

} 

static int 

sd_cmp( void const* el, void const* e2 ) 

{ 

double dl = ((SD const* )el)->v; 
double d2 = ((SD const*)e2)->v; 
if(dl >d2) return 1; 
else if(dl < d2) return -1; 
else return 0; 

} 

static int 

rankordr( double* pv, int n ) 

{ 

SD* pv2; 
int idx, jdx, hi; 
pv2 = new SD[n]; 
if(NULL = pv2) 
return 0; 

for(idx=l; idx<=n; idx++) { 
pv2[idx-l].v = pv[idx]; 
pv2[idx-l].i = idx; 
} 

::qsort( pv2, n, sizeof(SD), sd_cmp ); 
for(idx=0; idx<n; ) { 
for(hi=idx+l; hi<n; hi++) 
if(pv2[idx].v != pv2[hi].v) 
break; 
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double v = doub!e(sumi2j(idx+hhi))/double(hi-idx); 
for(j dx=idx ;jdx <hi ;j dx++) 
pv2[jdx].v = v; 
idx = hi; 

} 

for(idx=0; idx<n; idx++) 
pv[ pv2[idx].i ] = pv2[idx].v; 
delete [] pv2; 
return 1; 

} 

int 

SegRead::xtranorm( Wvfm& bdproc, Wvfm const& raw, int nscanl, int pass2 ) 
{ 

ShftVect& sv = nsv(); 
int lane, sdx; 
double gain - 0.0; 
if(!pass2) { 

double bandFreq[5]; 

Wvfm rawcopy( raw ); 

Wvfm rdycopy( bdproc ); 

if(l !=rdycopy.unstop()) 
return 0; 

sv = mcalign( rdycopy, sv ); 

if(0 = pmb_->remune()) { ShftVect noshft; sv = noshft; } 

pmb_->minFreq_ = 1.0; 

rawcopy.sort(); 

bandFreq[0] = pmb_->bandAmpl JO] = 0.0; 
for(lane= 1 ;lane<=4;lane++) 

pmb_->bandAmpl J lane ] = rawcopy.sc Ja( 2000, lane ); 
rawcopy = raw; 
rawcopy.envelope( sv ); 
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for(lane= I ;lane<=4;lane++) 
bandFreqf lane ] = 0.0; 
for(sdx=1+sv.maxshft(); sdx<-nscanl; sdx++) 
bandFreqf rawcopy.envi( sdx ) ] += 1.0; 
5 for(lane= 1 ; lane<=4; lane++) { 

bandFreqf lane ] /= double(nscanl-sv.maxshft()-Hl); 
if(bandFreq[lane] < pmb_->minFreq_) 
pmb_->minFreq_ = bandFreqf pmb_->minFreqLane_ = lane ]; 

} 

10 if(l != ::rankordr( pmb_->bandAmpl_, 4 )) 
return 0; 

} 

if(pmb_->minFreq_ < 0.15) { 

gain = (pmb_->bandAmpl_[pmb_->minFreqLaneJ < 2.0)? 0.5: 0.75; 
1 5 int NS = bdproc.endi(); 

for(sdx=bdproc.bgni();sdx<NS;sdx-H-) 
bdproc.scJajmiI( sdx, pmb_->minFreqLane_, gain ); 

} 

return 1 ; 

20 } 
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While the present invention has been described and illustrated in conjunction with a 
number of specific embodiments, those skilled in the art will appreciate that variations and 
modifications may be made without departing from the principles of the invention as herein 
illustrated and described. 
5 The invention may be embodied in other specific forms without departing from the 

spirit or essential characteristics thereof. The present embodiments are to be considered in all 
respects as illustrative, and not restrictive. 
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Claims 



1 . A method for determining base identity in unprocessed nucleic acid sequencing data, 
comprising the steps of: 

receiving unprocessed input data comprising unprocessed nucleic acid sequencing 
data; 

preprocessing said input data to generate preprocessed data; 
blind deconvolving said preprocessed data to generate blind deconvolved data; 
extranormalizing said blind deconvolved data to generate extranormalized data; 
detecting peaks in said extranormalized data using peak detection means to generate 
processed data; 

editing the quality of said processed data using fuzzy logic editing means to generate 
called nucleotide sequence and at least one quality value for said called 
sequence. 



2. The method according to claim 1, wherein said preprocessing step further comprises: 
identifying Begin and End points in said unprocessed data; 

establishing a baseline in said unprocessed data; 

subtracting said baseline from said unprocessed data to generate baseline-subtracted 
data; 

separating said baseline-subtracted data to generate preprocessed data, said separating 
step comprising spectral or leakage separation; 

3. The method according to claim 1 , wherein said extranormalizing step further 

comprises: 

correcting the relatives mobility of signals in said blind deconvolved data using 
Monte Carlo alignment means. 



4. The method according to claim 3, wherein said extranormalizing step further 
comprises: 

attenuating signals which were accentuated by said blind deconvolving. 
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5. The method according to claim 1 , wherein said peak detection means comprises 
frizzy logic insertion detection means to identify and remove putative insertions in 

said extranormalized data; and 
5 fuzzy logic gapchecking means to identify putative gaps in said extranormalized data 

and inserting data in said gaps. 

6. The method according to claim 5, further comprising: 

analysing said extranormalized data with said fuzzy logic insertion detection means 
10 before and after analyzing said extranormalized data with said fuzzy logic gap 

checking means. 

7. The method according to claim 1 , 

wherein said editing means generates at least one quality value by analyzing 
1 5 characteristics of said processed data selected from the group consisting of 

band height, band width, band shape, band's left gap, band's right gap, cross- 
banding and baseline buzz. 

The method according to claim 7, 

wherein said editing means generates at least one quality value from said 

characteristics of said processed data by applying a plurality of fuzzy logic 
rules. 

The method of claim 1, 

wherein said blind deconvolving is iterative and includes at least a first narrow-band 
guess for the filter band width value and a refined second band for the filter 
band width value. 

10. A method for identifying DNA sequence in unprocessed nucleic acid sequencing data, 
30 comprising the steps of: 
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receiving unprocessed input data comprising unprocessed nucleic acid sequencing 
data; 

preprocessing said input data to generate preprocessed data; 
blind deconvolving said preprocessed data to generate blind deconvolved data; 
extranormalizing said blind deconvolved data to generate extranormalized data; 
detecting peaks in said extranormalized data to generate peak detected-data; 
identifying and removing insertions in said peak detected-data using a fuzzy logic 

insertion detection algorithm; 
identifying and filling gaps in said peak detected-data using a fuzzy logic gap 

checking algorithm; and 
producing processed sequence data. 

1 1 . The method of claim 1 0, further comprising: 

editing the quality of said processed sequence data using fuzzy logic editing means to 
generate called nucleotide sequence and at least one quality value for said 
called sequence. 

12. The method according to claim 10, wherein said preprocessing step further comprises: 
identifying Begin and End points in said unprocessed data; 

establishing a baseline in said unprocessed data; 

subtracting said baseline from said unprocessed data to generate baseline-subtracted 
data; 

separating said baseline-subtracted data to generate preprocessed data, said separating 
step comprising spectral or leakage separation; 

1 3 . The method according to claim 1 0, 

wherein said extranormalizing step further comprises: 

correcting the relatives mobility of signals in said blind deconvolved data using a 
Monte Carlo alignment. 
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14. The method according to claim 13, wherein said extranormalizing step further 
comprises: 

attenuating signals accentuated by blind deconvolving. 

5 15. The method according to claim 1 0, further comprising: 

analyzing said extranormalized data with said fuzzy logic insertion detection 

algorithm before and after analyzing said extranormalized data with said fuzzy 
logic gap checking algorithm. 

10 16. The method according to claim 1 1 , wherein said editing means generates at least one 
quality value by analyzing characteristics of said processed data selected from 
the group consisting of band height, band width, band shape, band's left gap, 
band's right gap, cross-banding and baseline buzz. 

15 17. The method according to claim 1 6, wherein said editing means generates at least one 
quality value from said characteristics of said processed data by applying a 
plurality of fuzzy logic rules. 

1 8, The method of claim 10, 

20 wherein said blind deconvolving is iterative and includes at least a first 

narrow-band guess for the filter band width value and a refined, second guess 
for the filter band width value. 

1 9. A method of determining a nucleotide sequence of a DN A molecule comprising: 
25 providing a set of lane signals encoding the migration pattern of a DN A 

molecule subjected to DNA sequence analysis to generate an input data; 
preprocessing said input data to generate preprocessed data, said preprocessing 
comprising at least one of the following steps: 
identifying Begin and End points, 
30 subtracting baseline noise, 
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spectrally separating said input data using a separation 
matrix to correct for spectral cross-talk, and 

leakage separating said input data using a separation 
matrix to correct for lane leakage; 
5 blind deconvolving said preprocessed data to generate blind deconvolved data, said 

blind deconvolving deblurring signals in said preprocessed data and 
normalizing signal amplitudes, said blind deconvolving using an iterative filter 
band width algorithm; 
extranormalizing said blind deconvolved data to generate extranormalized data, said 
1 0 extranormalizing including at least one of the following steps: 

correcting relative signal mobility differences using a Monte 
Carlo alignment, and 

attenuating signals accentuated by blind deconvolution; 
detecting peaks in said extranormalized data to generate peak detected-data; 
1 5 identifying and removing insertions in said peak-detected data using fuzzy logic 

insertion detection algorithm; 
identifying and filling gaps in said peak-detected data using fuzzy logic gap checking 

algorithm; 
producing processed sequence data. 

20 

20. A digital computer system programmed to perform method for identifying DNA 
sequence in unprocessed nucleic acid sequencing data, said digital computer system 
including: 

a central processing unit, 
25 dynamic memory, and 

means for outputting data, 
the method comprising: 
receiving unprocessed input data comprising unprocessed nucleic acid sequencing 
data; 

30 preprocessing said input data to generate preprocessed data; 

blind deconvolving said preprocessed data to generate blind deconvolved data; 
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extranormalizing said blind deconvolved data to generate extranormalized data; 
detecting peaks in said extranormalized data to generate peak detected-data; 
identifying and removing insertions in said peak detected-data using a fuzzy logic 

insertion detection algorithm; 
identifying and filling gaps in said peak detected-data using a fuzzy logic gap 

checking algorithm; 
producing processed sequence data, and 

editing the quality of said processed data using fuzzy logic editing means to generate 
called nucleotide sequence and at least one quality value for said called 
sequence. 
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101 



Raw, Nx4 Trace Matrix 



102 



Preprocess 



i 

j 



103 



...... ..L 

* 



Ready, Nx4 Trace Matrix 



104 



Read Segment, 
2048x4 



Read Segment, 
2046x4 



( Coarse read to determine 
1 05 ■ band spacing and set 8D 
FBW; uses FBW 0 



Read Segment, ! 
2048x4 i 



106 



107 



108 



109 



Coarse read using last BD 
FBW; review band spacing 
and set new FBW. 



Coarse read using last BD 
FBW; review band spacing 
and set new FBW. 



i 



Fine read using BD FBW 
established above. 



Fine read using BD FBW 
established above. 



Fine read using BD FBW 
established above. 



Read, Aligned, 
Segment. 
2048x4 



Read. Aligned, 
Segment, 
2048x4 



Read, Aligned, 
Segment 
2048x4 



Processed and Reassembled, Nx4 Trace Matrix 



BaseQual Fuzzy Logic 



J 



110 



Final Traces, Called Sequence, and QualityValues 



Figure 1 
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201 



202 



203 



204 



205 



206 



Raw, Nx4 Trace Matrix 







Establish first and last scanlines 
within which to process the data. 



Baseline 

Establish, for each fluor, the best estimate 
of the baseline value. Subtract the 
baseline estimate. 



Spectral Separation 
Create the CHM and SST for the observed 
fluor ratios, then multiply the baseiined 
traces by the SST. 



Cubic Spline Interpolation 

(optional 2x or 3x) 







Ready. Nx4 Trace Matrix 




208 




209 




210 




Figur 2 
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301 



Read Segment, 2048xN 



302 



Blind Deconvolution 



303 



Extra Normalization 



J 



305 



306 



i — i — i — i — r 



A 



A 




304 



Peak Detection and 
Refinement 



307 



MM 
•i i: 



v 



Figur 3 
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401 



Blind Deconvolved, 2048x4 Trace 
Matrix 



406 



Ready, 2048x4 Trace Matrix 



402 



Identify, and zero, any regions 
with inordinately large and/or 
coincident bands. 



403 



Monte Carlo lane mobility 
cancellation. 



407 



408 



409 



Rank order the 97% ampl. of 
each data lane. This is a proxy 
for lane signal strength. 



Determine how frequently each 
lane dominates in amplitude.This 
is a proxy for a lane's band 
frequency. 



If a lane has fewer than 1 5%.of 
the bands, and it's signal strength 
is among the weakest, set the 
attenuation to 1/2. else set 
attenuation to 3/4. 

\ y 



404 



Attenuate BD'd 
band tight lane. 



410 


Band Light Lane 




and Gain (if applicable) 



405 



Blind Deconvolved, 2048x4 Trace Matrix, 
unstopped and attenuated if necessary. 
Plus, the alignment shift vector is known. 



Figur 4 
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501 



502 



503 



504 



505 



506 



507 



508 



Blind Deconvolved, 2048x4 Trace Matrix, 
unstopped and attenuated if necessary, 
and aligned.. 



Putative peak detection. 







Measure cross banding, instantaneous 
spacing, band height, low value, left and 
right gap to adjacent band. 







OmitOkN Fuzzy Logic 
(identify INS) 



Remove INS detected, then remeasure 
instantaneous spacing, instantaneous 
band width, left gap, band width, and 
called bases. 



GapCheck Fuzzy Logic 

(ident. DEL, propose 
INS) 

A 



Split gaps by inserting band(s), then 
reeasure cross banding, instantaneous 
spacing, band height, low value, left and 
right gap to adjacent band. 



OmitOkN Fuzzy Logic I 
(identify INS) j 

v..-.!.. 



509 



setBandStats 



Figure 5 
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601 



602 



603 



604 



Input: 

1) expected spacing curve 

2) array, band heights 

3) array, low intersect hts 

4) array, gap to L band 

5) array, gap to R band 

6) array, xbanding 



okLsp=okSpacing(Lsp,Esp) "\ 
okRsp=okSpacing(Rsp,Esp) j 
okSp = OR(okLsp,okRsp) J 



abLsp=badSpacing(Lsp,Esp) A 
abRsp=badSpacing(Rsp,Esp) | 
abSp=OR(abLsp,abRsp) j 



i— — . 

badXb=badXband(Xb) 
neglXb=neglXband{Xb) 



605 



606 



607 



608 



609 



611 



MHt=tinyHeight(HI) 
okHt=okHeight(Ht) 



i ■ 

J— 



RULE OK: 
neglXb AND okHt AND 
okSp 



RULE_N: 
badXb AND okHt AND okSp 



RULE.OMIT: 
tinyHt 



Scale Output Sets with 
each conclusion. 



Take centroid as 
conclusion 



Note: AND==MIN 
OR==MAX 



610 



OK 



OMIT 



2^. 



o 



o 

CO 



Figure 6 
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701 



702 



INPUT: 

1) Observed Spacing (osp) 

2) Expected Spacing (esp) 



Map Observed Spacing (osp) onto interval 
[0.0.. 1.0) using Expected Spacing (esp), 
forming Normalized Spacing (nsp). 

Example: with osp=17, esp=13 
nsp = rem(osp,esp) / esp 
nsp = 0.3 



Vs. 



703 



Determine membership of Normalized 
Spacing in the OkSpacing Memberhsip 
Ftn. 

Example: w nsp=0.3 membership = 0.7, 
or, for the most part the spacing is M OK\ 



704 




OkSpacing 



Figure 7 
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801 



802 



INPUT: 

1) Observed Spacing (osp) 

2) Expected Spacing (esp) 



'Map Observed Spacing (osp) onto 
interval [0.0.. 1.0] using Expected 
Spacing (esp), forming Normalized 
Spacing (nsp). 

Example: with osp=17, esp=13 
nsp = rem(osp.esp) / esp 
v nsp = 0.3 



803 



Determine membership of Normalized 
Spacing in the BadSpacing Memberhsip 
Ftn. 

Example: w nsp=0.3 membership = 0.5, 
or, for the most part the spacing is 
"somewhat Bad". 



804 




BadSpacing 



Figure 8 
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901 



Xb=2.0 Xb»2.0 




902 



INPUT: 

1) Crossbanding 
measurement, or the band 
amplitude divided by the 
amplitude of the next most 
prominent trace. 



903 



Determine membership of Absolute Cross 
Banding (Axb) in the CrossBanding 
Membership Ftns. 

Example: w Axb=1.5, badXb 
membership=0.25 and negligibleXb 
memberships .0. In other words, 
crossbanding is ok, but is not perfect. 



904 



1.00 



0.25 



badXb 



negligbileXb 




O N ^ 



1.5 



Cross Banding 



Figur 9 
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1001 



1002 





INPUT: 

Band Height, the highest 
point on the dominant 
trace. 









Determine membership of Absolute Height (Aht) 
measurement in the Height Membership Ftns. 

Example: w Aht=0.1 ( tinyHt memberships. 0 and 
okHt membership=1.0. In other words, band height 
is perfect. 




Figur 10 
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1102 



1103 



1104 

1105 

1106 
1107 

1108 



INPUT: 

1) Expected spacing curve 

2) Expected width curve 

3) Gap to Left neighbor band 

4) Width of band 

5) Sequence 



11/24 



Determine "gc-richness ' in N base 
region left of B n 



Normalize Observed B n to B^., gap 
<Ogp n ) and B^, to B fr2 gap (Ogp^) 
using expected spacing curve. 

Example: Ngp ft = (Ogp n /Egp„ - 1) 
Ngp^MOgp^/Egp,,-!) 



c 
c 



Normalize Observed B n width (Owd n ) 
and B^, width (Owd,,.,) using 
expected width curve. 

Example: Nwd n = (Owd n /Ewd n - 1) 
Nwd ft . 1 = (Owd <M /Ew d )l -1) 



bigGap„ * 
bigGap^, * 



b»gGap<Ngp n ,Egp n ) 
bigGap(Ngp n . v Egp ft ) 



medGap n • 
medGap,,, = 



medGapfNgp^EgpJ 
medGap(Ng P(Vl( Egp n ) 



smtGap n : 
smIGap,,., • 



smlGap(Ngp n( Egp n ) 
smlGapfNgp^EgpJ 



j 

) 
j 



btgWid,, * 
bigWid,,, - 



bigWid(Nwd n .Ewd ft ) 
bigWid(Nwd M ,Ewd n ) 



E 

J 



1109 



1110 



1111 



RULE.NORM: 
(bigGap n AND gcrich) OR 
(smlGap^ AND !bigWid 0 AND IbigWid^) OR 
!bigGap n 



RULE_SPLIT: 
(bigGap 0 ANO (bi^Wid^ OR bigWid n )) OR 
(bigGap n AND !smK3ap n ., AND !gcrich) 



Scale Output Sets with each rule and^ 
take centrotd as the conclusion. 



1112 



1.0 



Normal 



4r 



Split 
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Figure 11 
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Gap 



1201 




1202 



1203 



INPUT: 

1) Gap to left band, normalized with 
expected gap. 



Determine membership of normalized Gap 
measurement in the Gap Membership Ftns. 

Example: w Ngp=0.10, 

bigGap membership = 0.0 
medGap membership = 0.0 
srnlgap membership ~ 0.0 



1204 




0.10 mo . 

do dodo 



rvi in i^o 



8 



Gap Membership Ftns 



Figure 12 
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Width 



1301 




1302 



INPUT: 

1) Width of a band, normalized with Expected 
Width 



1303 



Determine membership of normalized Wdith 
measurement in the Width Membership Rn. 

Example: w Nwd=0.20, bigWid membership = 0. 



20 




Figure 13 
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1402 1407 

aa 

Low Integral Alignment Hl 9h integral Alignment 



1403 



INPUT: 

1) 2048x4 Trace Matrix 

2) Previous alignment 



1404 



Generate control structure with N 
initial alignment vectors (centered 
on previous alignment, if any) and 
their function value. The initial 

alignemnt vectors are the 
coordinates of the vertices of M 
concentric cubes. The function 

value is the integral of the 
envelope of the four traces when 
they're shifted per the alignment 
vector. 



1405 



Replace worst alignment vector 
with a moderate perturbation of 
the best alignment vector. Repeat 
this step until the the stop 
criterion is met. 



Figure 14 
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1501 



1502 



1503 

1504 

1505 
1506 
1507 



Input 

1) Band Height (ht) 

2) Cross-Banding (xb) 

3) Band Width (wd) 

4) Band Shape (sh) 

5) Baseline Buzz (bz) 

6) Band's Small Gap (Ig) 

7) Band's Large Gap (sg) 

* 

tinyh = tinyHeight(ht) 
smalh - smallHeight(ht) 
modrh = moderateHeight(ht) 
normh = normalHeight(ht) 
tallh = tallHeight(ht) 
okhght = modrh OR normh OR 
talih 



C presx - xbandPresent(xb) 
I neglx = xbandNegligibie(xb) 



J 



normw 



■ normalWidthfwd) 



goods » goodShape(sh) 
okbuz - baseiineBuzz{bz\ 



C 



= oKGap(sg; 



oksgap = 
oklgap s okGap(kj) 
okgap - oksgap OR oklgap 



J 



1508 



A=okbuz, B=normw, C=goods, D=okgap 

a=!okbuz, b-lnomrw, c=!goods, d-!okgap 

1Bad = aBCD OR AbCD OR ABcD OR ABCd 

2Bad = abCD OR aBcD OR aBCd OR AbcD OR AbCd OR 

ABcd 

3bad = abcD OR abCd OR Abed OR aBcd 
4bad = abed 
4ok = ABCD 

runfil = AbcO AND okhght AND neglx 



1509 



1510 



1511 



RULE.QUAL10: 
tinyh AND lokgap 



RULE.QUAU0: 
smalh AND presx AND !okgap 



c 



RULE.QUAL30: 
tinyh AND okgap 



Figur 15 
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1512 



1513 



1514 



1515 



1516 



1517 



RULE_QUAL40: 
(smalh OR modrh) AND (presx OR 3bad OR 4bad) 

RULEJ3UAL50: 
(smalh OR modrh) AND (Ineglx OR 2bad OR 
3f>ad) 



RULE_QUAL60: 
okhght AND (lokbuz OR Ineglx OR 
2bad) 



RULE_QUAL70: 
(okhght AND okbuz AND neglx AND 
2bad) OR 
runfil OR 
(smalh AND Ipresx AND 4ok) 



I 



RULE_QUAL80 
okhght AND okbuz AND neglx AND 
1bad 



RULE_QUAL90: 
okhght AND neglx AND 4ok 



1518 



Scale output sets w each rule and 
take centroid as the conclusion 




Figure 15 
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1601 



Input: 

1) Band height following Blind 
Deconvolution 



1602 



Determine membership of band height in the 
Height Membership Function(s). 

Example: wht=0. 18 

tinyHt =0.0, smalHt^O.O, normHt=1.0, 

tallHt=0.0 




Figure 16 



SUBSTITUTE SHEET (RULE 26) 



WO 98/11258 



PCT7US97/16933 



18/24 



1701 



1702 





Input: 

1) Cross-banding 
measurement 









Determine membership of cross banding 
(xb) in the BO Crossbanding membership 
functions. 

Example: with xb=1.4 neglx=0.33, 
!neglx=0.67, presx=0.0, !presx=1.0 



1703 



1.0 



0.33 



Xb present 



Xb negligible 




BQ Crossbanding 



Figure 17 
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1802 



1803 
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Input: 

1) Band Width = (End-Bgn)/medWD - 1 .0 







Determine membership of normalized 
band width in the Width Membership 
Function. 

Example: with wd=0.2 normw-1 .0, 
!normw=0.0 



1.0 



Normw 



0.2 



o 



Figure 18 
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Input: 

1) Band Shape, the linear correlation 
coefficient of the coefficients of a quadratic fit 
of the band in question with the coefficients of 
an 'ideal* band similarly located along the trace. 



Determine membership of normalized band 
shape in the Shape Membership function. 

Example: with sh=0.6 good s= 1.0 



1903 



1.0 



o o 
cn 



Good Shape 



Figure 19 
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2001 



2002 




Input 

1) Baseline Buzz, where bz = (Vt^-tg), 
and where t n indicates the amplitude of the 
nth smallest trace (t, tallest) 



2003 



Determine membership of baseline buzz ^ 
(bz) in the BQ Baseline Buzz membership 
functions. 

Example: with bz=0.28, okbuz=0.63, and 
!okbuz=0.37 i 




p 



o 

GO 



o 



BQ OK Baseline Buzz 



Figure 20 
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2101 



Input: 

1) Using Observed spacing (osp) and Expected 
spacing (esp) create Normalized spacing (nsp) 
using: 

nsp = rem(osp,esp)/esp 



2102 



Determine membership of normalized band spacing 
(nsp) in trie BQ OK Spacing membership function. 

Example: with osp-18, esp=15 t nsp=0.2, the 
spacing is given an unqualified "OK". okgap=1 .0 



2103 ,„ 










A o o o 

02 cj u» -nJ 



BQ OK Spacing 



Figure 21 
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Input: 




1) One row from the 4xN trace data matrix. 


1 





Establish left-to-right baseline using rising 
exponential threshold. 
THR = A 0 4(A ( /3r(exp(x-n-/tau)-1.0). 



Establish right-to-left baseline using rising 
exponential threshiold. 
THR = A n + (A n /3)*(exp(x-/tau)-1.0). 



Final baseline is geometric mean of the L- 
to-R and the R-to-L baselines. 
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Forward, Backward, and Combined. 




Baselin subtracted 
Figur 22 
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Raw, Nx4 Trace Matrix 



Break one lane of the trace data into 8 
zones and determine the maximum signal 
amplitude in each. 



Identify the two largest amplitude zones. 

: * ' =: 

If the two zones are adjacent set the Begin 
at the left-most zone maximum, and End 
to the last sample. 

; i ; 

^ - — > 

If the two zones are not adjacent set Begin 
at the left most zone maximum and End at 
the right most zone maximum. 



z 



Examine 400 samples right of Begin for a 
2nd primer peak (amplitude > 30% of 1st 
primer peak). Move Begin to 2nd peak if 
one exists. 



Move Begin right to the 1st sample which 
is below the mean of all samples between 
Begin and (Begin+End)/2. 



Move End left 350 samples. The tail end of 
the data is, at this juncture, is not worth 
reading. 



Figure 23 
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